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DNA COMPOUNDS COMPRISING SEQUENCES ENCODING MANNURONAN C-5-EPIMERASE 

The present invention concerns DNA compounds 
encompassing sequences coding for enzymes having 
mannuronan C-5-epimerase activity, a process for the 
preparation of such enzymes, the use of said genetic 
sequences in production of alginates having a definite G/M 
ratio and block structure, and the production of alginates 
having a definite G/M ratio by inactivating said genetic 
sequences . 

Throughout this application, reference is made to 
publications from the scientific and patent literature. 
Publications so cited are hereby incorporated in their 
entirety by reference. 

In this application the term gene is used to indicate a 
genetic sequence which encodes a protein, independent of 
whether the protein encoded by this genetic sequence is 
expressed or not in the natural host organism under those 
conditions . 

Alginates are a family of polysaccharides, which are 
synthesized in brown algae as well as in bacteria, such as 

ftzotofraqter vinelandii and Azotobacter chroococcum . 

Alginates are also synthesized by some strains of 
Pseudomonas sp. 

Chemically, alginates are unbranched binary copolymers of 
1-4 linked B-D-mannuronic acid, termed M, and its C-5 
epimer a-L-guluronic acid, termed G. 

Alginates derived from seaweeds and Azotobacter are 
generally true block copolymers where the monomers are 
arranged in homopolymeric stretches of M, termed M blocks, 
and homopolymeric stretches of G, termed G blocks, 
interspaced with regions containing both monomers, normally 
termed alternating blocks or MG blocks. The composition and 
sequential structure of alginates vary widely depending on 
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the source. Alginates produced by Pseudomonas . however, 
do not have any G blocks. 

Several functional properties such as the capacity to form 
gels and the binding of water depend on the M/G ratio and 
on the length of the various blocks, A relatively high 
content of G blocks, for instance gives good gelling 
properties, due to ionic cross linking of chains which 
takes place when Ca 2+ -ions are added to an alginate 
solution. The composition and block structure also 
influence on the immunological properties of alginates. 
[Otterlei et al, J. of Immunotherapy 10, 286-291, (1991)] 
have shown that alginates with a high content of mannuronic 
acid blocks are very potent nontoxic immunostimulants . 

At present industrial production of alginates rely 
exclusively on algal sources. The range in composition is 
however limited as the highest content of guluronic acid to 
be found is 75% and the lowest 25%. Furthermore there are 
no suitable sources for alginate with a G content in the 
range of 42-54%. In the field of biotechnology or 
biomedicine, polymers with extreme compositions, such as a 
high G for immobilization of cells, [Martinsen A., Skj&k- 
Braek G. and Smidsrod O. , Biotechnol. Bioeng. 33, 79-86, 
(1989)] and a high M (90-100%) as immunostimulants 
[Otterlei et al, J. of Immunotherapy 10, 286-291, (1991)] 
are of major interest. 

The key enzyme responsible for generation of the G blocks 
is called mannuronan C-5-epimerase. It was previously 
thought that only one enzyme having a certain amino acid 
sequence would exhibit this activity. It has now 
surprisingly been found that there exist at least five 
genes encoding enzymes having this activity. Some of these 
enzymes differ in molecular weight and amino acid sequence. 
The genes were found adjacent to each other in the 
bacterium Azotobacter vinelandii. It has also been found 
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that the amino acid sequence of the enzyme affects the 
activity of the enzyme, not only in terms of potency but 
also in the type of alginate formed, for example, altering 
the content of guluronic acid and the single/block G 
content of the alginate. 

In [Larsen, B. and Haug, A. , Carbohydr. Res. 17, 
(1971), 287-296 and 297-308] the isolation of mannuronan 
C-5-epimerase from liquid cultures of Azotobacter 
vinelandii is reported. In the following, this epimerase 
will be termed mannuronan C-5-epimerase (2) , and the DNA 
sequence encoding for it will correspondingly be 
denominated E2. 

In [Skj&k-Braek, G and Larsen, B, Carbohydr. Res. 103:133- 
136, (1982)], the purification of mannuronan C-5-epimerase 
(2) by affinity chromatography on alginate sepharose is 
disclosed. In a separate paper [Skj&k-Braek, G and Larsen, 
B., Carbohydrate Research, 139, (1985) 273-283] the 
characterization of this enzyme is disclosed. Further, the 
activity of the enzyme is described as an ability to 
epimerize both bacterial and algal alginate having a wide 
range in monomer composition and sequence of units. 

From PCT/WO 86/03781 and Japanese Patent Application 
J63233797 it is known to produce alginic acid and/or 
alginate having a high content of guluronic acid by action 
of the enzyme (E2) on an alginic acid or alginate, whereby 
the G content increases. 

In [Chitnis, C.E. and Ohman, D.E., J. Bacteriol., 172, 
P2894-2900, (1990)] the gene sequences involved in the 
introduction of guluronic acid into exopolysaccharides from 
Pseudomonas aeruginosa have been reported. However, the 
nature of the enzyme responsible for this process has not 
been identified. Since this genus of bacteria is unable to 
produce alginate containing G blocks [ Sk j &k-Braek , G., 
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Larsen, B. and Grasdalen, H. Carbohydr. Res. 54 (1986) 169- 
174] it is believed that the epimerization system in 
alginate producing Pseudomonas is fundamentally different 
from the epimerase in brown algae and in Azotobacter 
vinelandii . It seems likely that the Pseudomonas enzyme is 
a monomer epimerase acting at the sugar nucleotide level, 
and as such is unable to introduce G-blocks into already 
polymerized alginates. 

Production of mannuronan C-5-epimerase from Azotobacter 
vinelandii culture is difficult due to a very low yield. It 
is also a major obstacle that the enzyme is secreted 
together with copious amounts of highly viscous alginate 
which hampers the purification of the enzyme. Although 
alginates are secreted by some bacteria , an industrial 
production based on these microorganisms has not been 
successful* The main reasons are due to the difficulties in 
controlling the composition and molecular size of the 
exopolysaccharides. The content of guluronic acid blocks in 
the alginate from Azotobacter vinelandii tends to be too 
low for making a polymer with good gelling properties. 

Alginates with a high M content having immunogenic 
properties as reported above, are produced by Pseudomonas 
aeruginosa . but this organism is unattractive from a 
production point of view, as it is unstable in the 
production of the polymer. Further, the organism is known 
to be a secondary pathogen in patients suffering from 
cystic fibrosis. 

Thus, in order to produce medical grade alginates with a 
defined monomer composition and sequential structure there 
is a need for improved methods for controlling the 
biosynthesis of alginate, through controlling the key 
enzyme, the mannuronan C-5-epimerase. 

The present invention is directed to cloned DNA fragments 
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encoding mannuronan C-5-epimerase. The invention is 
encompassed by vectors which contain DNA fragments encoding 
mannuronan C-5-epimerase linked to DNA elements which 
direct the expression of mannuronan C-5-epimerase from the 
cloned DNA encoding the protein. The invention also 
provides for microorganisms which express the mannuronan 
C-5-epimerase protein from the cloned DNA as a source of 
the purified protein and also as a source of alginates of 
altered composition. Strains in which the expression level 
of the mannuronan C-5-epimerase gene is altered or in which 
one, several or all of the mannuronan C-5-epimerase genes 
have been inactivated are also within the scope of the 
present invention. The invention further encompasses 
methods for producing alginates either very efficiently, or 
having altered composition, or both, by culturing 
microorganisms having altered levels of expression of a 
mannuronan C-5-epimerase gene. 

The invention further features selection of epimerase to 
achieve a desired level of guluronic acid, and alter the 
single/block G characteristics of the enzyme. In a 

further embodiment, the invention features the production 
of synthetic proteins and DNA encoding such proteins which 
have mannuronan C-5-epimerase activity. 

Brief Description of Figures 

Figure 1 shows the amino acid sequence of the N-terminal 
end of the 122 kd protein , and the nucleotide sequence of 
the corresponding oligonucleotide. The DNA probe was 
synthesized as a mixture (in equal ratios) of the 64 
possible combinations that could be deduced from the first 
seven amino acids in the sequence of the 122 kd protein. N 
indicates that all four bases were used at this position. 

Figure 2 is a restriction endonuclase map of the combined 
inserts in plasmids pHE14, pHE16, pBDl, pHE18 and pMLl. 
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The numbers at the bottom line indicate the molecular sizes 
in bp. The arrow indicates the localization and orientation 
of the sequence homologous to the synthetic oligonucleotide 
used for screening the library. The five genes (open 
reading frames) found by sequencing are marked by boxes and 
denoted E4, El, E2, E3 and E5. El corresponds to 

Epimerase I. 

Figure 3 shows Mannuronan C-5-epimerase (1) activity of a 
portion of the El encoded protein as a function of cell 
growth. *: OD 600 of cell culture, o: Epimerase activity 
given as dpm/ml of cell culture. The strain used in this 
experiment was DH5a(pHE5), and the extracts were incubated 
with the substrate for 23 hours. 

Figure 4 shows the kinetics of 3 H release. The enzyme 
activity was assayed by using an extract prepared from 
IPTG-induced JM105 cells containing pHE5 (see legend to 
Table 3) . 

Figure 5 shows the homologies between and within the 
different genes. Boxes with the same letter are homologous 
to each other. Gaps are introduced to optimize the 
alignment. E1-E4 are defined as appears from Figures 2 
and 6. 

Figure 6 shows the nucleotide sequences and corresponding 
amino acid sequences for E4, El, E2 and part of E3. 

Figure 7 shows the alignment of the DNA sequences of the A 
blocks from E4, El, E2 and E3. 

Figure 8 shows the alignment of the deduced amino acid 
sequences of the A blocks from E4, El, E2 and E3. 

Figure 9 shows the alignment of the DNA sequences of the R 
blocks from E4, El, E2 and E3 . 



WO 94/09124 



PCT/NO93/00151 



7 

Figure 10 shows the alignment of the deduced amino acid 
sequences from the R blocks of E4, El, E2 and E3. 

Figure 11 shows 1 H-NMR spectra of alginate epimerized by 
extracts from A: DH5a(pHE8) (truncated epimerase 1); B; 
JM109(pBD9); C: no extract. The peak to the left gives 
the signal from 6-1; the peak in the centre gives the 
combined signal from GM-5 and M-l and the peak to the right 
gives the signal from GG-5. 

Figure 12 shows the nucleotide sequence and corresponding 
amino acid sequence of E2. 

Now according to the present invention genetic sequences 
have been found which encode enzymes having mannuronan C-5- 
epimerase activity, and thus the first aspect of the 
invention is pure isolated DNA comprising nucleotide 
sequences encoding mannuronan C-5-epimerase activity. 

The sequence of amino acids proximal to the amino terminus 
of purified mannuronan C-5-epimerase protein was determined 
[G. Skj&k Brak et al., Carbohydr. Res. 103:133-136 (1982)]. 
This data was used to derive a sequence for an 
oligonucleotide probe which was used to screen a gene 
library of Azotobacter vinelandii DNA. One result of this 
screening experiment was the surprising discovery of a 
second gene and thereafter three further genes including at 
least one genetic block A were found. Thus, altogether 
there appear to be at least five different genes encoding 
proteins having mannuronan C-5-epimerase activity. 
Accordingly, it is a second object of the present invention 
to provide for alternative DNA sequences encoding 
mannuronan C-5-epimerase. 

Three different blocks of genetic sequences, designated A, 
R and S, are found in the genes. These genetic blocks are 
most commonly found in combinations wherein the A appears 
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one time or two times, the R block appears from 0 to at 
least 5 times and the S block appears from 0-1 time. 

There is a high degree of consensus in the nucleotide 
sequences of each block for the different genes (1-5). 
Accordingly, it is a third object of the present invention 
to provide for DNA sequences encoding mannuronan C-5- 
epimerase and comprising the DNA blocks A and/ or S and /or 
R, wherein A may appear more than once and R if present may 
appear singly or in repeats of up to at least 5 or 6 times. 

The consecutive order of the three blocks if all three 
blocks are present, is preferably A, R and S. However, it 
has been shown that a reversed consecutive order, wherein 
for instance R appears before A also gives a gene encoding 
a mannuronan C-5-epimerase. Thus, the invention further 
encompasses genetic sequences having any order and any 
number of the blocks A, R and S. 

Another aspect of the present invention concerns the use of 
said genetic sequences for the preparation of the 
mannuronan C-5-epimerase in recombinant host cells. It is 
especially preferred to insert the gene into hosts such as 
bacteria, for instance Escherichia coli or Bacillus 
subtj.1 is or in yeast. The cloning and expression of the 
genetic sequence as described above in E. coli is described 
in the Examples. 

The present invention also encompasses recombinant 
expression plasmids that can be used to produce the 
mannuronan C-5-epimerase proteins in a host microorganism. 
Such expression plasmids are made by inserting a DNA 
fragment encoding mannuronan C-5-epimerase into a vector 
which contains appropriate expression elements, such as 
(but not limited to) a promoter, ribosome binding site, 
translational initiation site and transcription terminator. 
The expression plasmids can be adapted for transformation 
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into many different commonly used host organisms in which 
it might be desired to produce the mannuronan C-5- 
epimerase. 

The techniques for insertion of foreign genes into commonly 
employed hosts are known in the art, as described for 
instance in [METHODS IN ENZYMOLOGY, Vol. 185, Gene 
Expression Technology, Ed. D.V.Goeddel, Academic Press, 
Inc. (1990)]. Further by choice of a broad host range 
vector and a suitable promoter as known in the art, and 
described for instance in [J.L.Ramos et al, FEBS Letters, 
Vol. 226, 2, 241-246] it will be possible to insert and 
express the mannuronan C-5-epimerase genetic sequences in 
many different hosts. 

This will make possible the production of large quantities 
of one or all of the pure enzymes having this activity, 
while avoiding the problems of separating the enzymes from 
the alginate. 

By inserting a high copy-number vector comprising the 
genetic sequences encoding the epimerase into a natural 
alginate producing bacterium such as Azotobacter vinelandii 
an enhanced production of the enzymes would be possible. 

Over expression of the epimerases in a natural host could 
also be achieved by using a promoter which drives high- 
level expression of the enzymes. By blocking other genetic 
sequences coding for the alginate production, the 
production of pure enzymes may be achieved. 

Yet another aspect of the invention is the selective 
inactivation of the mannuronan C-5-epimerase genes in the 
natural host organism so as to provide for bacterial 
production of alginates having a low content of G blocks or 
even a pure poly-M alginate. This is accomplished by 
inserting nucleotides into one, several or all of the 
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mannuronan C-5-epimerase genes in the natural host organism 
Azotobacter . It is especially preferred to insert a DNA 
fragment encoding a selectable marker gene, preferably a 
gene conferring antibiotic resistance. Insertion of a 
selectable marker allows selection of those bacteria in 
which the insertion has been successfully accomplished. By 
choosing different selectable markers , for example 
providing resistance to different antibiotics, it is 
possible to select recombinants that have incorporated 
inserted sequences into some or all of the mannuronan C-5- 
epimerase genes. Thus, selective production of bacterial 
strains in which one of the mannuronan C-5-epimerase genes, 
several or all of them have been inactivated is possible, 

A second method of inactivating all of the epimerase genes 
is to transform a cell of the natural host strain, 
Azotobacter with a vector which expresses an antisense RNA 
which specifically binds to mRNA transcribed from these 
genes. Use of promoters of varying strength to* drive 
expression of the antisense RNA in the creation of the 
vectors used to transform the cells allows production of 
strains having varying ratios of G-blocks to M-blocks in 
the alginate produced. Use of inducible promoters to drive 
expression of the antisense RNA allows the creation of 
strains which can produce alginates of variable 
composition, depending on culture conditions. Clearly, if 
the recombinant host organism is the natural host, 
Azotobacter . it is possible to enhance production of one 
epimerase gene while leaving expression of the others at 
their normal level, thus producing a strain which makes an 
alginate having an altered ratio of G blocks to M blocks. 
A strain which makes alginate having 0-25% M blocks is 
preferred. 

Alternatively, all but one of the epimerase genes can be 
inactivated, as described above, and the expression of the 
remaining epimerase gene can be controlled by a regulated 



WO 94/09124 



PCT/NO93/001S1 



11 

promoter. A strain carrying such a complement of epimerase 
genes would thus produce alginates having a high content of 
G-blocks, especially from 75-98%. Another means for making 
a strain for producing alginates having a high G-block 
content is to inactivate all but one of the mannuronan 05- 
epimerase genes by insertion and control the remaining gene 
by antisense RNA, using an inducible promoter to regulate 
transcription of the antisense RNA gene. A still further 
means for making a strain for producing alginates having a 
high G-block content is by inactivating all naturally 
occurring genes and introducing a regulated gene through a 
vector . 

Thus the present invention also includes a process for the 
construction of a recombinant host cell capable of 
expressing mannuronan C-5-epimerase activity by 
transforming said host cell with a recombinant DNA 
expression vector that comprises: (a) a promoter and 
translational activating sequence that function in said 
host cell; and (b) a DNA sequence encoding mannuronan C-5- 
epimerase comprising at least a DNA block A and/ or a DNA 
block S and/ or a DNA block R, positioned for expression 
from said promoter and translational activity sequence. 

Also the present invention encompasses a process for the 
bacterial production of pure poly-M alginate or tailored 
alginates having a lower G block content, preferably in the 
range from 0-25%, by blocking the DNA sequences encoding 
the enzymes in a natural host by insertion of a foreign 
genetic sequence into one, several or all genetic sequences 
encoding mannuronan C-5-epimerase. 

Other methods for achieving the same end will be known for 
persons skilled in the art and are hereby included into the 
scope of the present invention. 

A further aspect of the invention are the novel enzymes 
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having mannuronan C-5-epimerase activity. The amino acid 
sequences and their degree of homology will appear from 
Figures 6-11. 

Also as known by a person skilled in the art, variations in 
the nucleotide sequence which nevertheless encode proteins 
having the same activity as the wild-type mannuronan C-5- 
epimerase are encompassed within this invention. 

Variations within the amino acid sequence may also 
encompass deletions, substitutions and additions which do 
not substantially change the biological activity. 

Also, it is possible to make a synthetic DNA sequence 
encodinq a mannuronan C-5-epimerase by techniques well 
known in the art. See for instance, [Itakura et al., 
Science 198:1056 (1977)] and [Crea et al. (Proc. Natl. 
Acad. Sci. USA 75:5765 (1978)] and also U.S. Patents 
4,800,159 and 4,683,202 and also published European patent 
application EP-A-0258017. Synthetic enzymes may be made 
by incorporating different combinations of the A, R and S 
elements, to maintain epimerase activity. The resultant 
alginate composition can be varied by enzyme selection. 

Materials and General Methods 

Bacterial strains, plasmids, and phage. Strains, plasmids, 
and phages are listed in Table 1. 

The bacterial strain of A, vinelandii used in these 
experiments, is freely available from Bjorn Larsen, Inst, 
of Biotechnology, Lab. for Marine Biochemistry, 7034 
Trondheim - NTH, Norway or Svein Valla, Unigen, Center for 
Molecular Biology, University of Trondheim, 7005 Trondheim, 
Norway and has been deposited at the Belgian Coordinated 
Collections of Microorganisms (BCCM) at the Laboratorium 
voor Microbiologie (IMG) at Universiteit Gent (RUG), K.L. 
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Ledeganckstraat 35 , B-9000 Gent. Other strains of 

A.vinelandii mentioned in Example 9 have the following ATCC 
numbers: ATCC 478, ATCC 12837 and ATCC 12518. Plasmids/ 
strains DH5a(pHE14) , JM109(pHE16) , JM109(pBDl) , 
JM109(pHE18) and SURE™(pMLl) have been deposited at BCCM at 
the Laboratorium voor Moleculaire Biologie (LMBP) (same 
address as LMG) and have the following numbers 

Growth of bacteria and phages. A. vinelandii was grown at 
30° C with shaking in a nitrogen-free medium (9.8 mM 
K 2 HP0 4 /KH 2 P0 4 , 0.8 mM MgS0 4 7H 2 O, 3.4 mM NaCl, 0.34 mM CaCl 2 , 
8.7 ^ Na 2 Mo0 4 2H 2 0, 54 /iM FeS0 4 7H 2 0, 1% sorbitol, pH 7.4). 
E. coli was grown in LB-medium [Sambrook J, Fritsch, E.F. 
and Maniatis T. r Molecular cloning, A laboratory manual, 
2nd ed. , Cold Spring Harbour Laboratory Press, New York, 
(1989)] with shaking at 37°C. When the cells were to be 
used for growth of phages the LB-medium was supplemented 
with 2.5 mM CaCl 2 , 10 mM MgCl 2 , and 0.4% maltose. Phages 
were plated on strain Q359 on L-agar (LB-medium 
supplemented with 2% agar) . Phage LB-medium supplemented 
with either 0.8% agar (titrations and gene library 
amplification) or 0.8% agarose (screening of gene library 
and preparation of phage lysates) was used for overlaying 
agar. 

Standard recombinant DNA technology. Restriction 
endonuclease digestions, removal of cohesive DNA ends by 
using the 3' exonuclease activity of T4 DNA polymerase, 
ligations, agarose gel electrophoresis, and end-labelling 
with 32 P were performed according to standard protocols 
[Sambrook J, Fritsch, E.F. and Maniatis T ., Molecular 
cloning, A laboratory manual, 2nd ed. , Cold Spring Harbour 
Laboratory Press, New York, (1989)]. Transformations were 
performed as described by [Chung, C.T., Niemela S.L. and 
Miller R.H., Proc. Natl. Acad. Sci USA, 86, 2172-2175, 
(1989)], and DNA sequencing was performed according to 
[Sanger F., Nicklen S. , and Coulsom, . A.R., Proc. Natl. 
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Acad- Sci USA, 74, 4563 (1977)]. 

Viscosimetric measurement. The alginate used in this 
experiment was obtained from Ascophyllum nodosum and had an 
intrinsic viscosity in 0.1 M NaCl of 17.6 dl/g at 25°C. The 
viscosity was determined by an Ubbelhode viscosimeter . 

NMR spectroscopy. The substrate used in these analyses was 
a low guluronic acid-containing alginate obtained from the 
brown algae Ascophvllum nodosum . and was prepared as 
described previously [Larsen, B., Proceedings of the Tenth 
International Seaweed Symposium, Ed: Levring, T.Gothenburg, 
p7-33, (1980)]. For the NMR analyses epimerase was obtained 
from IPTG-induced E. coli JM105 cells containing pHE5. 250 
ml cell culture were harvested by centrifugation and 
resuspended in 20 ml of 10 mM Tris, 0.34 mM CaCl 2 , pH 7.0. 
After ultrasonication, the solution was centrifuged at 
31.000 x g for 1 hour. The supernatant was stored frozen at 
70°C. After thawing the supernatant was filtered through a 
membrane with pore size 0.22 pm, and the enzyme was further 
purified on a Mono Q HR515 (Pharmacia) ion exchange column. 
The enzyme was eluted with a 0-1 M NaCl salt gradient (in 
the same buffer as the applied solution) , and was collected 
in 2 ml at approximately 0.6 M NaCl. To each of two tubes 
was added 0.28 ml of this enzyme solution (0.9 mg/ml total 
protein), 1 ml alginate (7.5 mg/ml in H 2 0) , and 4.62 ml 
2,3,6-trimethylpyridine buffer (see above). CaCl 2 was then 
added to a total reaction volume of 6 ml such that one tube 
contained 0.85 mM, and one contained 3.4 mM CaCl 2 . After 
incubation at 30°C for 20 hours Na 2 EDTA (10 mM) was added 
to chelate the Ca 2+ -ions, and the solutions were then 
dialyzed extensively against distilled water. The dialyzed 
alginate solutions were freeze-dried and then dissolved in 
D 2 0. NMR spectroscopy of these solutions were finally 
performed according to [Grasdalen H. , Larsen B. , and 
Smidsrod O. , Carbohydr. Res., 68, 23-31 (1979)] (Table 4). 
Further analysis was carried out in a similar fashion for 
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DH5a(pHE8) , JM109(pHE16) and JM109(pBD9). The results in 
Table 4 conclusively demonstrate that the enzymatic 
activity is mannuronan C-5-epimerase activity. This 
activity is expressed from a number of the plasmids showing 
that an entire epimerase gene/protein is not required in 
order to maintain epimerase activity. The epimerase 
activity is Ca 2+ dependent. 

Example 1 

Purification of mannuronan C-5-epimerase (1), partial amino 
acid sequencing and synthesis of a mixed DNA probe. The 
enzyme was isolated from liquid cultures of A. vinelandii 
essentially as described in [Skj&k-Braek, G. and Larsen, B. 
Carbohydrate Research, 103, (1982) 137-140]. The cells were 
removed by centrifugation and the enzyme was isolated by 
precipitation with 30% ammoniumsulphate and followed by 
centrifugation for 20 min. at 10000 rpm. The supernatant 
was then precipitated with 50% ammonium sulphate .(final 
concentration) , and the precipitate after centrifugation 
was dissolved in 0.05 M imidazole/HCl (pH 6.8) containing 
0.34 mM CaCl 2 and 0.5 mM dithiothreitol . This crude extract 
was then desalted on a prepacked column (PD-10) of Sephadex 
G-25 (Pharmacia) equilibrated with the same buffer. The 
extract was then applied on an alginate-Sepharose column. 
Proteins bound by non-specific interactions were eluted 
with 0.1 M NaCl. The epimerase was eluted as a sharp peak 
with 0.5 M NaCl.To make the enzyme pure enough for protein 
sequencing, it was dialyzed against TE-buffer overnight, 
freezedried, and further purified by SDS-PAGE gel 
electrophoresis (7.5% polyacrylamide in 25 mM Tris, 192 mM 
glycine, 0.1% sodium dodecyl sulphate, pH 8.3.) followed by 
electroblotting (in electrophoresis buffer without sodium 
dodecyl sulphate) onto a polyvinylidene difluoride 
membrane, poresize 0.45 pm (Millipore) . The membrane was 
stained with Coomassie brilliant blue and air dried, and 
the protein with Mw 122 kd was cut out for N-terminal 
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sequencing on a model 477A protein sequencing apparatus 
from Applied Biosystems. A DNA oligonucleotide was 
synthesized on the basis of the amino acid sequence 
information, and this oligonucleotide was used as a probe 
for screening of the gene library after end-labelling with 
32 P by polynucleotide kinase. 

Example 2 

Isolation of DNA from A. vinelandii and construction of a 
gene library. A. vinelandii cells were harvested and washed 
once in 0.9% NaCl. They were then lysed according to 
[Hansen, J.B. and Olsen, R.H. , J. Bacteriol. , 135, 227-238, 
(1978)], and the lysate was extracted twice with phenol and 
twice with chloroform. Nucleic acids were precipitated with 
ethanol, and the DNA was collected on a glass rod and 
dissolved in TE-buffer (lOmM Tris, ImM Na 2 EDTA, pH 7.9). 
Further purification was obtained by CsCl/ethidium bromide 
density gradient centrifugation. After removal of the 
ethidium bromide by isopropanol extraction, the DNA 
solution was dialyzed against TE buffer . 

The DNA (molecular size greater than 60 kb) was subjected 
to partial Sau3AI digestion under conditions maximizing the 
generation of 15-20 kb fragments. After ethanol 
precipitation the DNA was dissolved in 40 ^1 TE buffer to 
give a concentration of 0.5 pqf jxl. The DNA was then 
dephosphorylated with calf intestine phosphatase, followed 
by inactivation of the enzyme by incubation at 75 °C for 10 
minutes in the presence of 10 mN nitrilotriacetic acid. The 
dephosphorylated DNA was precipitated with ethanol and 
dissolved in 40 /il 0.1 x TE buffer. 

EMBL3 vector DNA was digested with BamH I + EcoRI, followed 
by an isopropanol precipitation step under conditions 
leaving the short BamH I / Eco RI oligonucleotides in solution 
[Frischauf A., Lehrach H., Poustka A. and Murray N., J. 
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Mol. Biol., 170, 827-842, (1983)]. The Sau3AI -digested and 
dephosphorylated A. vinelandii DNA (1.75 pq) was then 
ligated with the BamH I / Eco RI-dicrested vector DNA (4.75 /xg) , 
using T4 DNA ligase in a total reaction volume of 20 /xl. 
After ligations over night at 10 °C, 10 jil of the ligation 
mixture was subjected to in vitro packaging in a Promega 
Biotech packaging system. The in vitro constructed phage 
particles were titrated on the E. coli strain Q359, and the 
library was finally amplified on Q359 in one cycle by 
plating on solid medium. Screening of the library was 
performed according to standard protocols [Sambrook, J., 
Fritsch, E.F. and Maniatis, T., Molecular Cloning, 
Laboratory Manual, 2nd Ed., Cold Spring Harbour Laboratory 
Press, (1989)], except that the highest stringency wash was 
3.2 M tetramethylammoniumchloride at 50 °C. A total of 1.4 
x 10 5 primary recombinant phages were constructed, a library 
complexity far above what is required for obtaining 
representativity of A. vinelandii genes. 

Example 3 

Measurements of epimerase activity from mannuronan C-5- 
epimerase (1) . (5- 3 H) alginate was prepared as described in 
[Skj&k-Braek, G. and Larsen, B., Carbohydrate Res., 103, 
133-136, (1982) ] . The (5- 3 H) alginate was produced by growing 
Azotobacter vinelandii in a medium consisting of D-Glucose 
(20g) , K 2 HP0 4 (lg), MgS0 4 .7 H 2 0 (200mg) , FeS0 4 . 7 H 2 0 
(50mg), NaMo0 4 . 2 H 2 0 (5mg) , NH 4 0Ac (2.3g) and CaCl 2 . 2 H 2 0 
(50mg) diluted to one litre with water. The cells were 
grown at 30 °C with vigorous shaking. After 30 hours, D-[5- 
3 H]glucose was added to a concentration of 0.6mg/ml 
(Specific activity, 0.7 /iCi/mg) and the cells were allowed 
to grow for another 72 hours. The culture was cooled in an 
ice-bath, and the cells were removed by centrifugation. The 
supernatant solution was dialysed against 0.05M sodium EDTA 
(3x5 litres) for 24 hours followed by exhaustive dialysis 
against distilled water. The sodium alginate was then 
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precipitated with ethanol in the presence of 0.2% of sodium 
chloride. The specific activity of the label was 29 000 
dpm/mg alginate. The composition of this alginate was also 
analyzed by NMR spectroscopy, and it was found to contain 
59% mannuronic acid. Phage lysates were prepared by plating 
105 phages per plate. Two ml 2 , 3 , 6-trimethylpyridine buffer 
(50 mM, pH 6.9) were added to each plate, and the 
softagarose/buf fer mixture was scraped off, vortexed and 
centrifuged at 10 000 rpm for 10 min. The supernatant was 
used for incubations with (5- 3 H) alginate by mixing 0.25 ml 
(5- 3 H) alginate (2.5 mg/ml) , 6,3 pi 0.1 M CaCl 2 , and 1.45 ml 
phage lysate. The mixture was incubated at 30°C overnight, 
and the alginate was precipitated by addition of 15 /*1 5 M 
NaCl and 2 ml ethanol. After incubation at -20°C for 30 
min. the solution was centrifuged at 10.000 rpm for 30 
min., and 1 ml of the supernatant was used for 
determination of released 3 H [Skj&k-Brask, G. and Larsen, B. , 
Carbohydrate Res., 103, 133-136, (1982)]. in a liquid 
scintillation counter. 

For measurements of epimerase activity as release of 3 H in 
cells containing recombinant plasmids, the cell cultures 
were harvested by centrifugation and resuspended in 2,3,6 
trimethylpyridine buffer. When IPTG (3 mM) was used for 
induction of the lac-promoter, the inducer was added to 
exponentially growing cells and incubations were continued 
for 3 hours. Cells were disrupted by ultra-sonication, and 
varying amounts of the lysates were incubated with shaking 
together with 100 jzl (5- 3 H) alginate (2.5 mg/ml) and 400 /il 
2,3, 6-trimethylpyridine buffer (total volume 0.6 ml) in the 
presence of 3.3 mM CaCl 2 . The quantities of enzyme- 
containing cell extracts used were adjusted such that the 
measurements were performed under conditions where the 
enzyme represented the limiting factor. After incubation at 
30°C at the times indicated in each case, the mixtures were 
precipitated with ethanol under the conditions described 
above for phage lysates and 1.0 ml of the supernatant was 
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used for scintillation counting. Controls (using the 
appropriate host with the pUC18 vector) gave low 
backgrounds and these numbers were subtracted in the values 
presented in Table 3. 

Example 4 

Molecular cloning of a DNA fragment expressing a mannuronan 
C-5-epimerase activity in E. coli . The A. vinelandii gene 
library was constructed by cloning partially Sau3AI- 
digested A.vinelandii DNA into the bacteriophage X vector 
EMBL3 • In order to identify the epimerase gene in this 
library, we constructed a DNA probe based on the assumption 
that the previously purified 122 kd protein represented the 
epimerase [Skj&k-Braek, G. and Larsen, B. , Carbohydr. Res., 
103, 137-149 (1982)]. Initially we tried to use the 
corresponding protein solution for determination of the N- 
terminal amino acid sequence of the 122 kd protein, but the 
results showed that the preparation was not sufficiently 
pure for this purpose. We therefore purified the protein 
further by SDS-polyacrylamide gel-electrophoresis, followed 
by electroblotting onto a membrane. The band containing the 
122 kd protein was cut out from this membrane and subjected 
to N-terminal amino acid sequence analysis. Based on parts 
of this sequence, we synthesized the mixed DNA probe shown 
in Figure 1. 

The DNA probe synthesized as in Example 1 was labelled with 
32 P and then used for screening of the A. vinelandii gene 
library. Clones which hybridized reproducibly against the 
labelled probe were identified at a frequency of 
approximately 1(T 3 , and six such clones were selected for 
further studies. Phage lysates were prepared from each of 
the six clones, and each lysate was assayed for epimerase 
activity (Table 2). As can be seen, the lysates prepared 
from all six clones appeared to contain a weak enzyme 
activity that could represent the epimerase. This 
conclusion was further supported by the observation that 
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control lysates prepared from randomly picked recombinant 
phages in the library, gave reproducibly lower activity, 
representing the background activity. 

Example 5 

Subcloning of a DNA fragments encoding the epimerase. DNA 
from phage EP2 was partially digested with Sau3AI, and 
fragments ranging from 4 to 9 kb in size were subcloned in 
the BamH I site of plasmid pUC18. Cell extracts from DH5cr 
transformants containing recombinant plasmids were assayed 
for epimerase activity, and the corresponding plasmids were 
also hybridized against the synthetic oligonucleotide used 
for screening of the gene library. The analysis of the cell 
extracts showed that one of them contained an enzymatic 
activity consistent with the assumption that a polypeptide 
having epimerase activity was expressed from the plasmid 
(pHEl) in this clone (see Table 3). We have also tried to 
centrifuge the extract at 30000 g for 3.5 hours, and. found 
that the activity was not significantly reduced in the 
supernatant. Since we were unable to detect any significant 
activity in the culture medium, we conclude that the 
epimerase is localized intracellular ly in E.coli . The 
insert in pHEl also hybridized against the synthetic 
oligonucleotide used for screening, and pHEl was therefore 
selected for further analysis. 

Example 6 

Characterisation of the cloned DNA required for expression 
of the epimerase, and stability of the enzyme in vivo and 
in vitro. The insert in pHEl is approximately 4 kb in size, 
and Figure 2 shows the restriction map of this insert. 
Hybridization analysis of pHEl with the original synthetic 
oligonucleotide showed that the sequence hybridizing to the 
oligonucleotide was localized downstream of the SphI site. 
The hybridizing sequence was further characterized by DNA 
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sequencing , and this analysis showed that one of the 
potential reading frames of the sequence was in 100% 
agreement with the original N-terminal amino acid sequence 
of the 122 kd protein. Surprisingly , however, the 
orientation of the sequence was such that it would be 
transcribed out of the cloned fragment (see Figure 2). This 
result thus indicated that the observed epimerase activity 
was not correlated with the sequence encoding the 122 kd 
protein, a conclusion that was further confirmed by the 
observation that the terminal 0.5 kb SphI fragment could be 
deleted (generating plasmid pHE7) without loss of the 
epimerase activity from the corresponding cell extract. In 
addition to the SphI deletion, we deleted (from pHE7) the 
0.7 kb Konl fragment at the opposite terminus of the 
insert, generating plasmid pHE5. As shown in Table 3, pHE5 
(in DH5a) expressed the epimerase at a level approximately 
27 times higher than the level of expression from pHEl. 

During the expression studies described above we found that 
the measurements were quantitatively difficult to reproduce 
unless the time of harvesting the cells were kept as 
constant as possible. We have analyzed this problem more 
carefully by measuring the enzyme activity at different 
stages of growth of the E. coli cells. The results of such 
an analysis are shown in Figure 3, and indicate that the 
enzymatic activities in the cell extracts are drastically 
reduced shortly after the cells have entered the stationary 
phase. To obtain optimal enzyme yields it is therefore 
important to harvest the cells at the end of the 
exponential phase or at the beginning of the stationary 
phase. The reason for the reduction of epimerase activity 
might potentially be due to proteolysis of the epimerase in 
stationary phase cells. To study the stability of the 
enzyme in vitro we have also analyzed the kinetics of 3 H 
release in the DH5a(pHE5) extracts. As can be seen from 
Figure 4, the enzyme activity is linear over at least 30 
hours, demonstrating that the enzyme is very stable in 
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vitro. A critical parameter for obtaining reproducible 
results is thus the time of harvesting of the cells. 

Example 7 

Stimulation of the epimerase activity by induction of the 
lac promoter. The results described above showed that the 
levels of expression of the epimerase from pHE5 was 
significantly higher than in pHEl. The reasons for this 
could potentially be that the lac promoter was important 
for the expression , and we have therefore analysed this 
problem more closely. The analyses were performed in the JL_ 
coli strain JM105, a strain which expresses high levels of 
lac repressor, thus allowing a more repressed state of the 
promoter under uninduced conditions. When cell extracts 
prepared from uninduced and induced (with IPTG) cells of 
JM105(pHEl), a significant stimulation of enzyme activity 
was observed in the induced cells (Table 3) . A similar 
experiment using JM105(pHE5) showed even greater 
stimulation of the expression of the epimerase upon 
addition of IPTG in this case. These experiments thus 
showed that the lac promoter probably is a key element , 
although not necessarily the only element , involved in the 
expression of the epimerase from pHE5. The experiments in 
addition showed that the direction of transcription is from 
the Kpnl towards the SphI site in the insert. The epimerase 
gene is therefore transcribed in the same direction as the 
gene encoding the 122 kd protein, whose N-terminal amino 
acid sequence was used for the isolation of the cloned DNA. 

Preliminary experiments on deleting more DNA from the SphI 
side of the insert indicated that very little could be 
deleted without loss of the epimerase activity. At the Kpnl 
side, on the other hand, we found that significant 
deletions were tolerated. Table 3 shows the results of 
analysis of expression of the epimerase from a plasmid 
(pHE8) constructed by deleting the 0.8 kb KpnI/SacII 
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fragment from pHE5. As can be seen, this deletion resulted 
in a very strong stimulation of the epimerase activity both 
in uninduced and induced cells. The expression from pHE8 is 
presumably based on initiation of translation from the 
Shine-Dalgarno sequence in the vector (localized between 
the lac promoter and the polylinker) . Similarly, high 
levels of expression were obtained from pHE22 also due to 
the coding sequences being in frame with the Shine-Dalgano 
sequence. So far we have not obtained expression of the 
epimerase in constructs where deletions have extended 
beyond the SacII site. 

Example 8 

Use of a different promotor than the lac promotor. The 
insert in pHE5 (EcoRI-Hindlll) was sub-cloned into plasmid 
pT7-3 (a derivative of pT7-l described by [Tabor, S., and 
C.C. Richardson (1985). Proc. Natl. Acad. Sci. 82, 1074- 
1078]) , and the new plasmid was designated pLBl. The insert 
in pLBl is localized downstream of the <pl0 promoter in the 
vector. This promoter is only recognized by the 
bacteriophage T7 RNA polymerase, and expression of genes 
downstream of this promoter thereby becomes dependent on 
expression of this polymerase activity in the cells. The 
442 bp (see Figure 2) Sad - Spo l fragment was finally 
deleted from the insert in pLBl, generating plasmid pLB2. 
pLB2 was transformed into E. coli K38 (pGPl-2) . Plasmid 
pGPl-2 encodes the gene for T7 RNA polymerase, and the 
expression of the gene is controlled by a temperature 
inducible repressor. K38(pLBl, pGPl-2) was grown in 
exponential phase at 30° C for 4 1/2 hours. One of two 
parallel cell cultures was then transferred to 42 °C for 30 
minutes to induce the T7 -polymerase. The other parallel 
cell culture was grown at 3C°C for 5 hours. The epimerase 
activities in the cells were measured as described in 
example 3, and the results of the measurements are shown in 
Table 3 . 
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Example 9 

Cloning of mannuronan C-5-epimerase (2). Plasmid pHE12 was 
constructed by inserting a 6,2 kb Xho l fragment from the 
recombinant bacteriophage lambda derivative EP2 into 
pUC128. As can be seen from Figure 2 the insert in pHE12 is 
partly overlapping with the insert in pHEl. Analysis of 
extracts prepared from cells containing pHE12 (as described 
for pHEl) , showed that they expressed mannuronan C-5- 
epimerase activity (Table 3). Further analysis showed that 
the 2.5 kb Spo l- Xho l fragment could be deleted from the 
insert in pHE12 without affecting the expression of 
mannuronan C-5-epimerase. Further plasmids were 

constructed (see Figure 2) and the activity analysed (see 
Table 3) . This demonstrated that both the genes and gene 
fragments were able to express epimerase activity. The 
nucleotide sequences of the inserts were determined by the 
method of Sanger [ Sanger ,F., S. Nicklen, and A.R. Coulson. 
1977. Proc. Natl. Acad. Sci. 74, 5436]. The nucleotide 
sequences are shown in Figure 6. 

Example 10 

Sequence Comparison. Five genes have been identified as 
shown in Figure 2. The insert containing E5 is located 
about 5-10 kilobases away from the other genes. Figure 6 
shows the nucleotide sequence for the complete genes of E4, 
El, E2 and a large portion of E3. Detailed analysis of 
the nucleotide and amino acid sequences revealed highly 
homologous regions within each gene and between the various 
genes. Figure 5 characterises each of the genes by 
reference to the homologous blocks. Each of the genes has 
at least one A-element and at least one R-element. 

El, E2 and E4 all end with a reasonably homologous sequence 
termed the S-element (not shown in Figure 5) . The last 14 
amino acids of the S-element of El and E2 are identical 
with one exception. 
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Figures 7-10 show detailed analysis of the A- and R- 
elements within each gene by reference to the consensus 
sequence (con). Each A-element is approximately l r l50 
base pairs long and each R-element is approximately 450 
base pairs long. Short oligonucleotides are present in 
El, E2 and E3 between the second and thirds-elements. 
Gaps have been introduced where necessary to maximise 
alignment (see in particular the third R-element of E2) . 

Hybridization with a probe made from the first part of the 
A-element to a Southern blot of A. vinelandii digested with 
restriction endonuclease Bglll gave 5 distinct bands. One 
of these bands contained two A blocks, and another of these 
bands contained two different fragments with the same size. 
The number of bands were the same when other strains (ATCC 
478, ATCC 12837 and ATCC 12518) of the same species were 
used. This implies that the bacterium contains at least 5 
copies of the A-element, and that this is common to several 
independently isolated strains of A. vinelandii . 

The first part of each R-element contains six perfect and 
imperfect repeats of a nonapeptide with the consensus 
sequence LXGGAGXDX , except for the third R-element of E2 
which lacks two of these repeats. Figure 12 shows the 
complete nucleotide and corresponding amino acid sequence 
of E2. The nonapeptides have been marked with double 
lines for a good match with the consensus sequence and 
single lines for less good matches. This nonapeptide 
motif is characteristic of the haemolysin family of 
secreted proteins (Suh, Y. and Benedik, M.J., J. Bacterid 
174, (1992) 2361-2366). These proteins are all calcium 
dependent and are secreted by a pathway which does not 
involve cleavage of an N terminal signal peptide. For 
haemolysin secreted from E.coli it has been proposed that 
the nonamers are responsible for the binding of calcium 
ions (Ludwig, A. et al, Mol. Gen. Genet. 214, (1988) 553- 
561, Boehm, D.F. et al, Infect. Immun. 58 (1990) 1959- 
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1964). It appears that the R-elements are involved in 
calcium ion binding, calcium being necessary for both 
enzyme activity and gel formation. 

Example 11 

Making an altered epimerase. 

As can be seen from Table 3, various elements may be 
deleted from the gene, while maintaining the expression of 
a protein having epimerase activity. Clearly, the latter 
portion of El having the sequence ARS has epimerase 
activity (see plasmid pHE8) , although deletions in A2 of El 
are not tolerated (see Example 7) . Additionally, E2 
having the sequence ARRRRS also demonstrates epimerase 
activity. Additionally, fragments of E3 lacking a carboxy 
terminal and having the sequences ARRR and ARRRARR have 
expressed epimerase activity. (See plasmids pH18 and 
pBD6) . Accordingly, it appears that an S-element is not 
essential for epimerase activity, although the presence of 
this element may affect activity. We therefore postulated 
that an epimerase may need at least one A-element and at 
least one R-element, and that it should be possible to make 
altered epimerases by combining these elements in different 
ways. To show this, we constructed a plasmid encoding an 
epimerase with the sequence RARS: 

The insert in pHEl ( Eco RI- Hind lll ) was subcloned into 
plasmid pTrc99A (Pharmacia), generating plasmid pHE21. 
This plasmid contains a trc-promoter in front of the 
epimerase I gene, a strong transcription termination signal 
downstream of the gene, and the lacl^-gene allowing 
induction with IPTG. pHE21 was digested with Kpn l and 
Spo l, made blunt-ended with SI nuclase and religated. The 
resulting plasmid, pHE22, expresses a protein having the 
carboxy terminal of epimerase 1, RARS. The epimerase 
activity was measured as in Example 3, see Table 3. 
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Given that epimerase activity is expressed from a number of 
the constructs, it seems likely that a number of synthetic 
enzymes may be produced having epimerase activity including 
differing numbers of A, R and S blocks. The presence of 
activity in pHE22 implies that it is not essential to have 
an amino-terminal A block, and so block order may also be 
altered. 

Example 12 

The 1 H-NMR spectra of alginate epimerased by extracts from 
plasmids pHE8 and pBD9 show that the proteins encoded by 
these plasmids have different enzyme activity, pHE8 
producing epimerase with single G activity while pBD9 
producing epimerase with G block activity. pHE8 encodes 
the carboxy terminal ARS of El whereas pBD9 encodes ARRRRS 
of E2. The naturally encoded epimerases may therefore 
have differing activity particularly in the distribution 
patterns of Gs. The different activity of the various 
epimerases encoded within the 5 genes could be used to 
create alignates having a desired structure, by selectively 
expressing a desired gene or genes. Alternatively, it may 
be possible to construct synthetic enzymes varying the A, 
R and S block content of each epimerase to provide enzymes 
having altered activity providing a further level of 
control in the production of desired alginates. 
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Table 1-1 Bacterial strains, phages, and plasmids. 



Strain/phage/ 
plasmid 



Remarks 



References 



Bacterial strains 

A. vinelandii Strain E 



E. coli 
Q359 

DH5cc 



JM105 



JM109 



TM 



SURE 



Phages 
EMBL3 



supE hsdR 08O r P2 

supE 44AlacU169 (080 lacZ AM15) 
hsdR 17 recAl endA l ovrA 96 
thi-1 relAl 

supE endA sbcB 15 hsdR 4 rpsL 
thiA ( lac-proAB) 

recAl supE 44 endA l hsdR 17 
ovrA 96 relAl thi A ( lac-proAB ) 
Ff traD 36 proAB ^lac I ^lacZAM15 1 
e!4" ( mcrA l . A ( mcrCB-hsdSMR-mrr ) 
171 , endAl, supE44 , thi^l, 
qyrA96 , relAl, lac, recB, recJ, 
sbcC, umuC:Tn5(kan r ) , uvrC, [F' 
proAB , lacI^ZAM15, TnlO, (tet r )] 



Larsen and Haug 
(1971) 

Karn et al. (1980) 

Bethesda Research 
Laboratories (1986) 

Yanisch-Perron 
et al (1985) 

Yanisch-Perron 
et al (1985) *' 

Greener (1990) 



Bacteriophage X vector used for 
construction of A. vinelandii 
gene library 
EPx Randomly picked phage from 

A. vinelandii gene library 
EP2, -3, -6, Phages isolated from A. 
-7, -8 and -9 vinelandii gene library 

Identified by hybridization and 
expresses mannuronan C-5-epimerase 



Frischauf et al. 
(1983) 



See examples 
See examples 



Plasmids 

pUC18 Ampicillin resistance, ColEl 

replicon 

pUC128 Ampicillin resistance, ColEl 

replicon 

pTrc99A Ampicillin resistance, ColEl 

replicon 

pT7-3 Ampicillin resistance, ColEl 

replicon 

pGPl-2 Kanamycin resistance, P15A 

replicon 

pBluescript II Ampicillin resistance, ColEl 
SK(+) replicon 

pHEl Derivative of pUC18 where a 

4 kb Sau3Al DNA fragment from 
phage EP2 was subcloned into the 
BamH l poly linker 



Norrander et al* 
Keen et al. (1988) 
Pharmacia 

Tabor & Richardson 

(1985) 
Tabor & Richardson 

(1985) 

See examples 



SUBSTITUTE SHEET 
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Table 1-2 
pHE7 

pHE5 

pHE8 



pLBl 

pLB2 
pHE12 

pBDl 
pHE21 



pHE2 

pHE16 
pBD9 
pBD6 
pHE18 

pHE14 
pMLl 



See examples 
See examples 



Derivative of pHEl where a 0.5 kb See examples 
Sph l DNA fragment was deleted 
Derivative of pHE7 where a 0.7 kb 
Kpn l DNA fragment was deleted 
Derivative of pHE5 where a 0.8 kb 
Kpn l/ Sac ~I DNA fragment was 
deleted. Cohesive ends were 
removed ;>rior to ligation by 
using the 3' exonuclease activity 
of T4 DNA polymerase 
Derivative of pT7-3 f where the 
2,7 kb insert from pHE5 was cloned 
into the Eco RI/ Hind lll polylinker 
Derivative of pLBl, where a 0.4 kb See examples 
Sac l/ Spo l fragment was deleted 

Derivative of pUC128, where a 
6.2 kb Xho l fragment from phage 
EP2 was subcloned into the Xho l 
polylinker 

Derivative of pHE12, where a 
2.0 kb Spo l/ Nsi l fragment was 
deleted 

Derivative of pTrc99A, where €he 
4.0 kb insert of pHEl was cloned 
into the EcoR I / Hind lll polylinker. 
Derivative of pHE21, where a 
1.2 kb Kpn I/ Spo l fragment was 
deleted 

Derivative of pUC18 where a 

6.0 kb St>h l DNA fragment from 
phage EP6 was cloned into the 
Sph I site in the polylinker of 
the vector 

Derivative of pHE2 where a 
1.5 kb EcoR I- Sma l DNA fragment 
was deleted 

Derivative of pBDl where a 
0.4 kb Xhol-Fsp l DNA fragment 
was deleted 

Derivative of pHE12 where a 
3.4 kb Xhol - Esp l DNA fragment 
was deleted 

Derivative of pUC128 where an 

5.1 kb Notl-PvuII DNA fragment 
from EP6 was cloned into the 
Notl-EcoRV sites in the polylinker 
of the vector 

Derivative of pUC128 where a 
3.0 kb Bal ll DNA fragment from 
EP6 was cloned into the BamHI 
site of the polylinker 
Derivative of pBluescript II 
SK( + ) where a 4.3 kb Kpnl-SacII 
DNA fragment was cloned into 
the corresponding sites in the 
polylinker 



See examples 



See examples 



See examples 



See examples 



See examples 



See examples 



See examples 
See examples 

See examples 

See examples 



See examples 



See examples 
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Table 2. Putative mannuronan C-5-epimerase activity in 
recombinant phage lysates . 



Recombinant phage 


3 H release (dpm) 


EPx 


39 


EP2 


91 


EP3 


107 


EP6 


74 


EP7 


75 


EP8 


245 


EP9 


75 



EPx originated from the A . vi ne 1 andi i gene library as a 
randomly picked plaque, while the other six phages were 
selected on the basis of the hybridization between their DNA 
and the labelled oligonucleotide used for screening of the 
library- 
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Table 3. Activity of the mannuronan C-5-epimerase 
expressed from the plasmids . 



Enzyme 

. 


Strain 


Released 3 H/OD 600 unit 
cell culture 


No IPTG 


IPTG 


Epimerase l 1 


JMlUy ipHElb ) 


10000 


110000 


Epimerase l 2 


Dno (XlpHEl ) 


273 


nd 


Epimerase I 3 


DnouipnED ) 


9700 


nd 


Epimerase l 4 


JM1UD ipHEl ; 


637 


2800 


Epimerase I 5 


JMlUb (pHE5 ) 


4800 


28500 


Epimerase l 6 


JM105 (pHE8) 


58900 


181000 


Epimerase l 7 


JM109 (pHE21) 


93 


1283 


Epimerase l 8 


JM109 (pHE22) 


5383 


34611 


Epimerase I 9 


K38(pGPl-2,pLB2)* 


2150 


8333 


Epimerase 2 1 


JMl09(pHEl2) 


nd 


140 


Epimerase 2 2 


JM109 (pBD9) 


nd 


6700 


Epimerase 3 1 


JM109 (pHEl8) 


551 


2270 


Epimerase 3 2 


JM109 (pBD6) 


nd 


530 


Epimerase 4 


DH5a(pHE14) 


nd 


3500 



The extracts were incubated with the alginate for 16 
hours, and the numbers are given in dpm. nd = not 
determined. 

* The culture was not induced by IPTG, but by raising the 
temperature from 30°C to 42°C. 
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SEQUENCE LISTING 
SEQUENCE LISTING NO: 1 

SEQUID NO:l 

SEQUENCE TYPE: Nucleotide with corresponding proteins. 
SEQUENCE LENGTH: 12411 base pairs 

STRANDEDNESS : single 

TOPOLOGY: linear 

MOLECULE TYPE: genomic DMA 

ORIGINAL SOURCE ORGANISM: Azotobacter vinelandii strain E 
FEATURES: 

from 290 to 1951 bp epimerase 4 
from 2227 to 6438 bp epimerase 1 
from 6702 to 9695 bp epimerase 2 
from 9973 to 12411 bp first part of epimerase 3 

PROPERTIES: Azotobacter vinelandii mannuronan C-5-epimerase..genes 

1 GATCCGGCCG TCTGAGACGG CGCCTCCGGC CGTCGGCGAG TGCGCCGTTC GCCGACGGCC 

61 GGGCGAACGG ATGAGGACTG CTCCACTCTC ACCCAGATAA GCGCGTGGGC CGTTTCATCC 

121 GAGCGCCTTT CCGGGCCGCT TCGAAAGACC GCCACGAGGC ACTCTGTGCA AGGGCCAGGC 

181 AGTCGCGTTG CAACCGGAGA CGGGACCGGC CCGTTCGGGC GTCGTCTCTT CCCGCTCCAC 

241 TTTTTCCAGG CAGCTTCGGC TGCTCCACTC GGAACCGGGA AGCGGAGAT 



290 


ATG 

Met- 
ric u 


GAT 
Asp 


TAC 
Tyr 


AAC 
Asn 


GTC 
Val 


AAG 
Lys 


GAT 
Asp 


TTC 
*Phe 


GGT GCA 
Gly Ala 


TTG 
Leu 


GGC 
Gly 


GAC 
Asp 


GGC 
Gly 


GTC 
Val 


AGC 
Ser 


338 


GAC 
Asp 


GAC 
Asp 


CGG 
Arg 


GCC 
Ala 


TCC 
Ser 


ATC 
He 


CAG 
Gin 


GCG 
Ala 


GCG 
Ala 


ATC 
He 


GAT 
Asp 


GCC 
Ala 


GCC 
Ala 


TAC 
Tyr 


GCC 
Ala 


GCC 
Ala 


386 


GGT 
Gly 


GGC 
Gly 


GGT 
Gly 


ACC 
Thr 


GTC 
Val 


TAC 
Tyr 


CTG 
Leu 


CCG 
Pro 


GCC GGC 
Ala Gly 


GAG 
Glu 


TAC 
Tyr 


CGG 
Arg 


GTC 
Val 


AGC 
Ser 


GCC 
Ala 


434 


GCC 
Ala 


GGG 
Gly 


GAG 
Glu 


CCG 
Pro 


GGC 
Gly 


GAC 
Asp 


GGC 
Gly 


TGC 
Cys 


CTG 
Leu 


ATG 
Met 


CTC 
Leu 


AAG 
Lys 


GAC 
Asp 


GGC 
Gly 


GTC 
Val 


TAC 
Tyr 


482 


CTG 
Leu 


GCC 
Ala 


GGT 
Gly 


GCC 
Ala 


GGC 
Gly 


ATG 
Met 


GGC 
Gly 


GAG 
Glu 


ACG 
Thr 


GTG 
Val 


ATC 
He 


AAG 
Lys 


CTG 
Leu 


ATC 
He 


GAC 
Asp 


GGC 
Gly 


530 


TCC 
Ser 


GAC 
Asp 


CAG 
Gin 


AAG 
Lys 


ATC 
lie 


ACC 
Thr 


GGC 
Gly 


ATG 
Met 


GTC 
Val 


CGC 
Arg 


TCG 
Ser 


GCC 
Ala 


TAC 
Tyr 


GGC 
Gly 


GAG 
Glu 


GAA 
Glu 


578 


ACC 
Thr 


AGC 
Ser 


AAC 
Asn 


TTC 
Phe 


GGC 
Gly 


ATG 
Met 


CGC 
Arg 


GAC 
Asp 


CTG 
Leu 


ACC 
Thr 


CTC 
Leu 


GAC 
Asp 


GGC 
Gly 


AAC 
Asn 


CGC 
Arg 


GAC 
Asp 


626 


AAC 
Asn 


ACC 
Thr 


AGC 
Ser 


GGC 
Gly 


AAG 
Lys 


GTC 
Val 


GAC 
Asp 


GGC 
Gly 


TGG 
Trp 


TTC 
Phe 


AAC 
Asn 


GGC 
Gly 


TAT 
Tyr 


ATC 
He 


CCC 
Pro 


GGC 
Gly 


674 


GGG 
Gly 


GAC 
Asp 


GGC 
Gly 


GCC 
Ala 


GAC 
Asp 


CGC 
Arg 


GAC 
Asp 


GTG 
Val 


ACC 
Thr 


ATC 
He 


GAG 
Glu 


CGG 
Arg 


GTG 
Val 


GAG 
Glu 


GTC 
Val 


CGC 
Arg 


722 


GAG 
Glu 


ATG 
Met 


TCC 
Ser 


GGC 
Gly 


TAC 
Tyr 


GGC 
Gly 


TTC 
Phe 


GAC 
Asp 


CCC 
Pro 


CAC 
His 


GAG 
Glu 


CAG 
Gin 


ACC 
Thr 


ATC 
He 


AAC 
Asn 


CTG 
Leu 


770 


ACG 
Thr 


ATC 
lie 


CGC 
Arg 


GAC 
Asp 


AGC 
Ser 


GTG 
Val 


GCC 
Ala 


CAC 
His 


GAC AAC 
Asp Asn 


GGC 
Gly 


CTC 
Leu 


GAC 
Asp 


GGC 
Gly 


TTC 
Phe 


GTC 
Val 


818 


GCC 
Ala 


GAC 
Asp 


TAC 
Tyr 


CTG 
Leu 


GTC 
Val 


GAC 
Asp 


AGC 
Ser 


GTG 
Val 


TTC 
Phe 


GAG 
Glu 


AAC 
Asn 


AAC 
Asn 


GTC 
Val 


GCC 
Ala 


TAC 
Tyr 


GCC 
Ala 


866 


AAC 
Asn 


GAC 
Asp 


CGC 
Arg 


CAC 
His 


GGC 
Gly 


TTC 
Phe 


AAC 
Asn 


GTG 
Val 


GTC 
Val 


ACC 
Thr 


AGC 
Ser 


ACC 
Thr 


CAC 
His 


GAT 
Asp 


TTC 
Phe 


GTC 
Val 


914 


ATG 
Met 


ACC 
Thr 


AAC 
Asn 


AAC 
Asn 


GTC 
Val 


GCC 
Ala 


TAC 
Tyr 


GGC 
Gly 


AAC 
Asn 


GGC 
Gly 


AGC 
Ser 


AGC 
Ser 


GGC 
Gly 


CTG 
Leu 


GTG 
Val 


GTG 
Val 
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962 


CAG 
Gin 


CGG 
Arg 


GGT 
Gly 


CTG 
Leu 


GAG 
Glu 


GAC 
Asp 


CTC 
Leu 


GCG 
Ala 


CTG 
Leu 


CCC 
Pro 


AGC 
Ser 


AAC 
Asn 


AVC 
He 


CTG 
Leu 


ATC 
He 


GAC 
Asp 


1010 


GGC 
Gly 


GGC 
Gly 


GCC 
Ala 


TAC TAC 
Tyr Tyr 


GAC 
Asp 


AAC 
Asn 


GCC 
Ala 


CGC 
Arg 


GAA 
Glu 


GGC 
Gly 


GTG 
Val 


CTG 
Leu 


CTC 
Leu 


AAG 
Lys 


ATG 
Met 


1058 


ACC 
Thr 


AGC 
Ser 


GAC 
Asp 


ATC 
He 


ACC 
Thr 


CTG 
Leu 


CAG 
Gin 


AAC 
Asn 


GCC 
Ala 


GAT 
Asp 


ATC 
He 


CAC 
His 


GGC 
Gly 


AAC 
Asn 


GGC 
Gly 


TCC 
Ser 


1106 


TCC 
Ser 


GGG 
Gly 


GTG 
Val 


CGC GTC 
Arg Val 


TAC 
Tyr 


GGC 
Gly 


GCC 
Ala 


CAG 
Gin 


GAC 
Asp 


GTG 
Val 


CAG 
Gin 


ATC 
lie 


CTC 
Leu 


GAT 
Asp 


AAC 
Asn 


1154 


CAG 
Gin 


ATC 
He 


CAC 
His 


GAC AAC 
Asp Asn 


GCG 
Ala 


CAG 
Gin 


GCG 
Ala 


GCC 
Ala 


GCC 
Ala 


GTG 
Val 


CCC 
Pro 


gXg 

Glu 


GTC 
Val 


CTG 
Leu 


CTG 
Leu 


1202 


CAG 
Gin 


TCC 
Ser 


TTC 
Phe 


GAC GAT 
Asp Asp 


ACC 
Thr 


GCC 
Ala 


GGG 
Gly 


GCG 
Ala 


TCC 
Ser 


GGC 
Gly 


ACC 
Thr 


TAC 
Tyr 


TAC 
Tyr 


ACG 
Thr 


ACC 
Thr 


1250 


CTG 
Leu 


AAC 
Asn 


ACC 
Thr 


CGG 
Arg 


ATC 
He 


GAG 
Glu 


GGC 
Gly 


AAC 
Asn 


ACC 
Thr 


ATC 
He 


AGC 
Ser 


GGC 
Gly 


TCG 
Ser 


GCC 
Ala 


AAC 
Asn 


TCC 
Ser 


1298 


ACC 
Thr 


TAC 
Tyr 


GGC 
Gly 


ATC 
He 


CAG 
Gin 


GAG 
Glu 


CGC 
Arg 


AAC 
Asn 


GAC 
Asp 


GGC 
Gly 


ACC 
Thr 


GAC 
Asp 


TAC 
Tyr 


AGC 
Ser 


AGC 
Ser 


CTG 
Leu 


1346 


ATC 
He 


GAC 
Asp 


AAC 
Asn 


GAC ATC 
Asp He 


GCC 
Ala 


GGG 
Gly 


GTG 
Val 


CAA 
Gin 


CAG 
Gin 


CCC 
Pro 


ATC 
He 


CAA 
Gin 


CTG 
Leu 


TAC 
Tyr 


GGA 
Gly 


1394 


CCT 
Pro 


CAC 
His 


TCG 
Ser 


ACG 
Thr 


GTA 
Val 


TCC 
Ser 


GGC 
Gly 


GAA 
Glu 


CCC 
Pro 


GGC 
Gly 


GCG 
Ala 


ACA 
Thr 


CCG 
Pro 


CAA 
Gin 


CAG 
Gin 


CCG 
Pro 


1442 


TCC 
Ser 


ACG 
Thr 


GGA 
Gly 


AGC 
Ser 


GAC 
Asp 


GGC 
Gly 


GAG 
Glu 


CCA 
Pro 


CTG 
Leu 


GTC 
Val 


GGC 
Gly 


GGC 
Gly 


GAC 
Asp 


ACG 
Thr 


GAC 
Asp 


GAC 
Asp 


1490 


CAG 
Gin 


CTC 
Leu 


CAG 
Gin 


GGC 
Gly 


GGC 
Gly 


TCC 
Ser 


GGC 
Gly 


GCC 
Ala 


GAT 
Asp 


CGC 
Arg 


CTG 
Leu 


GAC 
Asp 


GGC 
Gly 


GGG 
Gly 


GCC 
Ala 


GGC 
Gly 


1538 


GAC 
Asp 


GAC 
Asp 


ATC 
He 


CTC 
Leu 


GAC 
Asp 


GGC 
Gly 


GGC 
Gly 


GCC 
Ala 


GGG 
Gly 


CGC 
Arg 


GAC 
Asp 


CGG 
Arg 


CTG 
Leu 


AGC 
Ser 


GGC 
Gly 


GGC 
Gly 


1586 


GCG 
Ala 


GGC 
Gly 


GCC 
Ala 


GAC 
Asp 


ACC 
Thr 


TTC 
Phe 


GTG 
Val 


TTC 
Phe 


TCC 
Ser 


GCC 
Ala 


CGC 
Arg 


GAG 
Glu 


GAC 
Asp 


AGC 
Ser 


TAC 
Tyr 


CGT 
Arg 


1634 


ACC 
Thr 


GAC 
Asp 


ACG 
Thr 


GCG 
Ala 


GTG 
Val 


TTC 
Phe 


AAC 
Asn 


GAC 
Asp 


CTG 
Leu 


ATC 
He 


CTC 
Leu 


GAC 
Asp 


TTC 
Phe 


GAG 
Glu 


GCC 
Ala 


AGC 
Ser 


1682 


GAG 
Glu 


GAT 
Asp 


CGC 
Arg 


ATC 
He 


GAC 
Asp 


CTG 
Leu 


TCC 
Ser 


GCG 
Ala 


CTG 
Leu 


GGC 
Gly 


TTT 
Phe 


TCC 
Ser 


GGC 
Gly 


CTG 
Leu 


GGC 
Gly 


GAC 
Asp 


1730 


GGC 
Gly 


TAT 
Tyr 


GGC 
Gly 


GGC 
Gly 


ACC 
Thr 


CTG 
Leu 


CTC 
Leu 


CTG 
Leu 


AAG 
Lys 


ACC 
Thr 


AAC 
Asn 


GCC 
Ala 


GAG 
Glu 


GGC 
Gly 


ACG 
Thr 


CGC 
Arg 


1778 


nv.v» 

Thr 


Tyr 


Leu 


AAA 
Lys 


AGC 
Ser 


TTY"* 
I IV. 

Phe 


Glu 


Ala 


GAl 

Asp 


Ala 


GAG 
Glu 


GGA 
Gly 


CGG 
Arg 


CGC 
Arg 


TTC 
Phe 


GAG 
Glu 


1826 


GTC 
Val 


GCC 
Ala 


CTG 
Leu 


GAC 
Asp 


GGC 
Gly 


GAC 
Asp 


CAC 
His 


ACG 
Thr 


GGC 
Gly 


GAT 
Asp 


CTT 
Leu 


TCC 
Ser 


GCC 
Ala 


GCC 
Ala 


AAT 
Asn 


GTG 
Val 


1874 


GTC 
Val 


TTC 
Phe 


GCC 
Ala 


GCG 
Ala 


ACC 
Thr 


GGG 
Gly 


ACG 
Thr 


ACC 
Thr 


ACC 
Thr 


GAA 
Glu 


CTC 
Leu 


GAA 
Glu 


GTV 
Va. 


CTC 
Leu 


GGC 
Gly 


GAC 
Asp 


1922 


AGC 
Ser 


GGC 
Gly 


ACG 
Thr 


CAG 
Gin 


GCC 
Ala 


GGG 
Gly 


GCG 
Ala 


ATC 
He 


GTC 
Val 


TAG 


CGCGTCCCGC TCCGACACAT 



1972 AGCCGGTCGT CGGCAAGGCG GCCGGCCGCC GGCTGCCCCG AAGTTTCCAA TCTAATCTCA 

2032 CCTACAGACA GGCGCGTTCC GGTGCGCCCG AGCGCCGCCC CCGGGAACGA CCGGCAGGGC 

2092 GTGTTTGTGC GCAAGGTGCA GGCGGTCGCG CTCGAAGCCA GAGGCAGGGA AAACCTTTTC 

2152 CGGCAGTCGT CTCTTCCTTC TCCACTTCCC AGGCAGCCCT GGGCCGAGCA ACACGACGGG 

2212 ATTAGGAAGC GGATC ATG GAT TAC AAC GTC AAG GAT TTC GGA GCA 
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Met Asp Tyr Asn Val Lys Asp Phe Gly Ala 



2257 


CTG GGC GAT GGC GTC 
Leu Gly Asp Gly Val 


AGC 
Ser 


GAC 
Asp 


GAC 
Asp 


ACG 
Thr 


GCG 
Ala 


GCC 
Ala 


ATC 
He 


CAG 
Gin 


GCG 
Ala 


GCG 
Ala 


ATC 
He 


2305 


GAC GCC GCC CAC GCG 
Asp Ala Ala His Ala 


GCG 
Ala 


GGC 
Gly 


GGC 
Gly 


GGC 
Gly 


ACC 
Thr 


GTC 
Val 


TAC 
Tyr 


CTG 
Leu 


CCG 
Pro 


GCC 
Ala 


GGC 
Gly 


2353 


GAA TAT CGG GTC AGC 
Glu Tyr Arg Val Ser 


GGC 
Gly 


GGC 
Gly 


GAG 
Glu 


GAG 
Glu 


CCT 
Pro 


TCC 
Ser 


GAT 
Asp 


GGT 
Gly 


TGT 
Cys 


CTG 
Leu 


ACC 
Thr 


2401 


ATC AAG 
He Lys 


AGC 
Ser 


AAC 
Asn 


GTC 
Val 


CAT 
His 


ATC 
He 


GTC 
Val 


GGC 
Gly 


GCC 
Ala 


GGG 
Gly 


ATG 
Met 


GGC 
Gly 


GAG 
Glu 


ACG 
Thr 


GTG 
Val 


2449 


ATC AAG ATG GTC GAC 
He Lys Met Val Asp 


GGC 
Gly 


TGG 
Trp 


ACG 
Thr 


CAG 
Gin 


AAC 
Asn 


GTC 
Val 


ACC 
Thr 


GGC 
Gly 


ATG 
Met 


GTG 
Val 


CGC 
Arg 


2497 


TCG GCC TAC GGC GAG 
Ser Ala Tyr Gly Glu 


GAA 
Glu 


ACC 
Thr 


AGC 
Ser 


AAC 
Asn 


TTC 
Phe 


GGC 
Gly 


ATG 
Met 


AGC 
Ser 


GAC 
Asp 


CTG 
Leu 


ACC 
Thr 


2545 


CTC GAC GGC AAC CGC GAC AAC CTG TCC GCC AAG GTC GAC GGC TGGT-TTC 
Leu Asp Gly Asn Arg Asp Asn Leu Ser Ala Lys Val Asp Gly Trp Phe 


2593 


AAC 
Asn 


GGC 
Gly 


TAC 
Tyr 


ATC 
He 


CCC 
Pro 


GGC 
Gly 


CAG 
Gin 


GAC 
Asp 


GGC 
Gly 


GCC 
Ala 


GAT 
Asp 


CGC 


GAC GTG 
Asp Val 


ACC 
Thr 


CTG 
Leu 


2641 


GAG 
Glu 


CGG 
Arg 


GTG 
Val 


GAA 
Glu 


ATC 
He 


CGC 
Arg 


GAG 
Glu 


ATG 
Met 


TCC 
Ser 


GGC 
Gly 


TAC 
Tvr 


GGT 
Glv 


TTC GAC 
Phe Asp 


CCC 
Pro 


CAC 
His 


2689 


GAG 
Glu 


CAG 
Gin 


ACC 
Thr 


ATC 
He 


AAC 
Asn 


CTG 
Leu 


ACG 
Thr 


ATC 
He 


CGC 
Arg 


GAC 
Asp 


AGC 
Ser 


GTG 
Val 


GCC 
Ala 


CAC 
His 


GAC 
Asd 


AAC 
Asn 


2737 


AGC 
Ser 


CTC 
Leu 


GAC 
Asp 


GGC 
Gly 


TTC 
Phe 


GTC 
Val 


GCC 
Ala 


GAC 
Asp 


TAC 
Tyr 


CAG 
Gin 


GTC 
Val 


GGC 
Gly 


GGG 
Gly 


GTG 
Val 


TTC 
Phe 


GAG 
Glu 


2785 


AAC 
Asn 


AAC 
Asn 


GTC 
Val 


TCG 
Ser 


TAC 
Tyr 


AAC 
Asn 


AAC 

Asn 


Asp 


vbv 

Arg 


LAC 

His 


GGC 

Gly 


TTC 
Phe 


AAC 
Asn 


ATC 
He 


GTC 
Val 


ACC 
Thr 


2833 


AGC 
Ser 


ACC 
Thr 


AAC 
Asn 


GAC 
Asp 


TTC 
Phe 


GTC 
Val 


CTG 
Leu 


AGC 
Ser 


AAC 
Asn 


AAC 
Asn 


GTC 
Val 


GCC 
Ala 


TAC 
Tyr 


GGC 
Gly 


AAC 
Asn 


GGC 
Gly 


2881 


GGC 
Gly 


GCC 
Ala 


GGC 
Gly 


CTG 
Leu 


GTG 
Val 


GTG 
Val 


CAG 
Gin 


CGC 
Arg 


GGC 
Gly 


TCG 
Ser 


TAC 
Tyr 


GAC 
Asp 


CTG 
Leu 


CCC 
Pro 


CAT 
His 


CCC 
Pro 


2929 


TAC 
Tyr 


GAC 
Asp 


ATC 
He 


CTG 
Leu 


ATC 
He 


GAC 
Asp 


GGC 
Gly 


GGC 
Gly 


GCC 
Ala 


TAC 
Tyr 


TAC 
Tyr 


GAC 
Asp 


AAC 
Asn 


GCC 
Ala 


TTG 
Leu 


GAA 
Glu 


2977 


GGC 
Gly 


GTG 
Val 


CAG 
Gin 


CTC 
Leu 


AAG 
Lys 


ATG 
Met 


GCC 
Ala 


CAC 
His 


GAC 
Asp 


GTC 
Val 


ACC 
Thr 


CTG 
Leu 


CAG 
Gin 


AAC 
Asn 


GCC 
Ala 


GAG 
Glu 


3025 


ATC 
He 


TAC 
Tyr 


GGC 
Gly 


AAC 
Asn 


GGC 
Gly 


CTG 
Leu 


TAC 
Tyr 


GGG 
Gly 


GTG 
Val 


CGC 
Arg 


GTC 
Val 


TAC 
Tyr 


GGC 
Gly 


GCC 
Ala 


CAG 
Gin 


GAC 
Asp 


3073 


GTG 
Val 


CAG 
Gin 


ATC 
He 


CTC 
Leu 


GAC 
Asp 


AAC 
Asn 


CAG 
Gin 


ATC 
He 


CAC 
His 


GAC 
Asp 


AAT 
Asn 


TCG 
Ser 


CAG 
Gin 


AAC 
Asn 


GGC 
Gly 


GCC 
Ala 


3121 


TAT 
Tyr 


GCC 
Ala 


GAA 
Glu 


GTC 
Val 


CTG 
Leu 


CTG 
Leu 


CAC" 
Gil 


TCC 
Ser 


TAC 
Tyr 


GAC 
Asp 


GAC 
Asp 


ACC 
Thr 


GCC 
Ala 


GGG 
Gly 


GTG 
Val 


TCC 
Ser 


3169 


GGC 
Gly 


AAC 
Asn 


TTT 
Phe 


TAC 
Tyr 


GTC 
Val 


ACC 
Thr 


ACC 
Thr 


GGC 
Gly 


ACC 
Thr 


TGG 
Trp 


CTC 
Leu 


GAA 
Glu 


GGC 
Gly 


AAC 
Asn 


GTC 
Val 


ATC 
He 


3217 


AGC 
Ser 


GGC 
Gly 


TCG 
Ser 


GCC 
Ala 


AAT 
Asn 


TCC 
Ser 


ACC 
Thr 


TAC 
Tyr 


GGC 
Gly 


ATC 
He 


CAG 
Gin 


GAG 
Glu 


CGC 
Arg 


GCC 
Ala 


GAC 
Asp 


GGC 
Gly 


3265 


ACC 
Thr 


GAC 
Asp 


TAC 
Tyr 


AGC 
Ser 


AGC 
Ser 


CTC 
Leu 


TAC 
Tyr 


GCC 
Ala 


AAC 
Asn 


AGC 
Ser 


ATC 
He 


GAC 
Asp 


GGT 
Gly 


GTG 
Val 


CAG 
Gin 


ACC 
Thr 


3313 


GGG 


GCG 


GTA 


CGG 


CTG 


TAT 


GGC 


GCC 


AAC 


TCG 


ACG 


GTT 


TCC 


AGC 


CAG 


TCC 
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Gly 


Ala Val 


Arg 


Leu 


Tyr 


Gly 


3361 


GGC 
Gly 


AGT GGC 
Ser Gly 


CAG 
Gin 


CAG 
Gin 


Ala 


Thr 


3409 


CTG 
Leu 


AGC GGG 
Ser Gly 


ACC 
Thr 


GAG 
Glu 


GCC 
Ala 


cue 
His 


3457 


GAC 
Asp 


CGC CTG 
Arg Leu 


AAC 
Asn 


GGC 
G3y 


GAT 
Asp 


Ala 


3505 


GGG 
Gly 


CGC GAC 
Arg Asp 


AAC 
Asn 


CTG 
Leu 


ACC 
Thr 


GGC 
Gly 


3553 


TCC 
Ser 


GCG CGC 
Ala Arg 


ACC 
Thr 


GAC 
Asp 


AGC 
Ser 


TAP 

Tyr 


3601 


CTG 
Leu 


ATC ACC 
He Thr 


GAC 
Asp 


TTC 
Phe 


GAC 
Asp 


GCC 
Ala 


3649 


CTG 
Leu 


GGC TTC 
Gly Phe 


ACC 
Thr 


GGC 
Gly 


CTG 
Leu 


GGC 
Gly 


3697 


AAG 
Lys 


ACC AAC 
Thr Asn 


GCC 
Ala 


GAG 
Glu 


GGT 
Gly 


ACG 
Thr 


3745 


GAC 
Asp 


GCC CAG 
Ala Gin 


GGC 
Gly 


CGG 
Arg 


CGC 
Arg 


TTC 

Phe 


3793 


GGT 
Gly 


CTG TTC 
Leu Phe 


AAC 
Asn 


GAC 
Asp 


Asn 


Asn 


3841 


GGT 
Gly 


ACC GAG 
Thr Glu 


GGC 
Gly 


AGC 
Ser 


Gar* 
Asp 


Asn 


3889 


CTC 
Leu 


CTG GGC 
Leu Gly 


TAC 
Tyr 


Gly 


Gly 


Asn 


3937 


GAC 
Asp 


ATC CTC 
He Leu 


GTC 
Val 


GGC 
Gly 


Gly 


Ala 


3985 


GGG 
Gly 


GCG GAC 
Ala Asp 


GTG 
Val 


TTC 
Phe 


CGC 

Arg 


TTC 

Phe 


4033 


TAC 
Tyr 


ACC ACC 
Thr Thr 


GGC 
Gly 


GAC 
Asp 


AAC 
Asn 


^AG 

Gin 


4081 


ACC 
Thr 


CTG GAC 
Leu Asp 


AGG 
Arg 


ATC 
He 


Asp 


Val 


4129 


AAC 
Asn 


GGC CGC 
Gly Arg 


AAC 
Asn 


GGC 
Gly 


ACC 
Thr 


CTC 
Leu 


4177 


CGC 
Arg 


ACC GAT 
Thr Asp 


CTG 
Leu 


AAG 
Lys 


AGC 
Ser 


TAC 

Tyr 


4225 


GAG 
Glu 


CTT TCC 
Leu Ser 


CTC 
Leu 


GCG 
Ala 


GGT 

Gly 


AAC 

Asn 


4273 


TTC 
Phe 


GTT TTC 
Val Phe 


GCG 
Ala 


ACG 
Thr 


TCT 
Ser 


CAG 
Gin 


4321 


ACC 
Thr 


GAC GGC 
Asp Gly 


AAC 
Asn 


GAT 
Asp 


ACC 
Thr 


TTG 
Leu 


4369 


CTC 
Leu 


GGC CTG 
Gly Leu 


GAC 
Asp 


GGC 
Gly 


CGG 
Arg 


GAC 
Asp 


4417 


ATC 


CTC GAC 


GGC 


GGA 


GCG 


GGG 



37 



Ala Asn Ser Thr Val Ser Ser Gin Ser 

CTC GAA GGC AGC GCG GGC AAC GAT GCG 
Leu Glu Gly Ser Ala Gly Asn Asp Ala 

GAG ACG CTG CTC GGC CAG GCC GGC GAC 
Glu Thr Leu Leu Gly Gin Ala Gly Asp 

GGC AAC GAC ATC CTC GAC GGC GGG GCA 
Gly Asn Asp He Leu Asp Gly Gly Ala 

GGC GCG GGC GCC GAC ACC TTC CGC TTC 
Gly Ala Gly Ala Asp Thr Phe Arg Phe 

CGC ACC GAC AGC GCC AGC TTC AAC GAC 
Arg Thr Asp Ser Ala Ser Phe Asn Asp 

GAC GAG GAC AGC ATC GAC CTG TCC GCG 
Asp Glu Asp Ser He Asp Leu Ser Ala 

GAC GGC TAC AAT GGC ACC CTG CTG CTG 
Asp Gly Tyr Asn Gly Thr Leu Leu Leu 

CGC ACC TAC CTG AAG AGC TAC GAA GCG 
Arg Thr Tyr Leu Lys Ser Tyr Glu Ala 

GAG ATC GCC CTG GAC GGC AAC TTC ACC 
Glu He Ala Leu Asp Gly Asn Phe Thr 

CTG TTG TTC GAC GCC GCT CCG GCC ACC 
L.eu Leu Phe Asp Ala Ala Pro Ala Thr 

CTG CTC GGC ACC GAC GCC GGG GAA ACC 
Leu Leu Gly Thr Asp Ala Gly Glu Thr 

GAC ACC CTC AAC GGC GGG GCC GGC GAC 
Asp Thr Leu Asn Gly Gly Ala Gly Asp 

GGG CGC GAC AGC CTG ACC GGC GGC GCC 
Gly Arg Asp Ser Leu Thr Gly Gly Ala 

GAC GCG CTG TCC GAC AGC CAG CGC AAC 
Asp Ala Leu Ser Asp Ser Gin Arg Asn 

GCC GAC CGC ATT CTC GAC TTC GAC CCG 
Ala Asp Arg He Leu Asp Phe Asp Pro 

TCG GCG CTG GGC TTC ACC GGG CTG GGC 
Ser Ala Leu Gly Phe Thr Gly Leu Gly 

GCC GTG GTG CTC AAC AGC GCC GGC GAC 
Ala Val Val Leu Asn Ser Ala Gly Asp 

GAC ACC GAC GCC AAC GGC TAC AGC TTC 
Asp Thr Asp Ala Asn Gly Tyr Ser Phe 

TAC CAG GGG CAG CTC AGC GCC GAG CAG 
Tyr Gin Gly Gin Leu Ser Ala 31u Gin 

GGG GGA CAG ATG ACG ATT ATC 3AA GGC 
Gly Gly Gin Met Thr He He Glu Gly 

CAG GGC ACC GAG GCC AAC GAG CGG CTC 
Gin Gly Thr Glu Ala Asn Glu Arg Leu 

AAC CTG AAC GGC GGC GCC GGC GAC GAC 
Asn Leu Asn Gly Gly Ala Gly Asp Asp 

CGC GAC ACC CTG ACC GGC GGC ACG GGG 
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lie Leu Asp Gly Gly Ala Gly Arg Asp Thr Leu Thr Gly Gly Thr Gly 

44 65 GCC GAC ACC TTC CTG TTC TCC ACG CGT ACC GAC AGC TAC CGC ACC GAC 
Ala Asp Thr Phe Leu Phe Ser Thr Arg Thr Asp Ser Tyr Arg Thr Asp 

4513 AGC GCC AGC TTC AAC GAC CTG ATC ACC GAC TTC GAT CCC ACC CAG GAC 
Ser Ala Ser Phe Asn Asp Leu He Thr Asp Phe Asp Pro Thr Gin Asp 

4561 CGC ATC GAC CTG TCC GGC CTG GGC TTC AGC GGT TT: GGC AAC GGC TAC 
Arg He Asp Leu Ser Gly Leu Gly Phe Ser Gly Ph ; Gly Asn Gly Tyr 

4609 GAC GGC ACC CTG CTG CTG CAG GTC AAC GCC GCG GGC ACC CGC ACC TAC 
Asp Gly Thr Leu Leu Leu Gin Val Asn Ala Ala Gly Thr Arg Thr Tyr 

4657 CTG AAG AGT TTC GAG GCC GAT GCC AAC GGC CAG CGC TTC GAG ATC GCC 
Leu Lys Ser Phe Glu Ala Asp Ala Asn Gly Gin Arg Phe Glu He Ala 

4705 CTG GAC GGC GAC TTC AGC GGC CAA TTG GAC AGC GGC AAC GTG ATC TTC 
Leu Asp Gly Asp Phe Ser Gly Gin Leu Asp Ser Gly Asn Val He Phe 

4753 GAG CCC GCC GTG TTC AAT GCC AAG GAC TTC GGC GCG CTG GGC GAC GGC 
Glu Pro Ala Val Phe Asn Ala Lys Asp Phe Gly Ala Leu Gly Asp Gly 

4801 GCC AGC GAC GAC CGG CCG GCC ATC CAG GCG GCG ATC GAC GCC GCC TAC 
Ala Ser Asp Asp Arg Pro Ala He Gin Ala Ala He Asp Ala Ala Tyr 

4849 GCG GCC GGT GGC GGC ACC GTC TAC CTG CCG GCC GGC GAG TAC CGG GTC 
Ala Ala Gly Gly Gly Thr Val Tyr Leu Pro Ala Gly Glu Tyr Arg Val 

4897 AGC CCC ACC GGG GAG CCG GGC GAC GGC TGC CTG ATG CTC AAG GAC GGC 
Ser Pro Thr Gly Glu Pro Gly Asp Gly Cys Leu Met Leu Lys Asp Gly 

4945 GTC TAC CTG GCC GGC GAC GGC ATA GGC GAA ACG GTC ATC AAG CTG ATC 
Val Tyr Leu Ala Gly Asp Gly He Gly Glu Thr Val He Lys Leu He 

4993 GAC GGC TCC GAC CAG AAG ATC ACC GGC ATG GTC CGC TCG GCC TAT GGC 
Asp Gly Ser Asp Gin Lys He Thr Gly Met Val Arg Ser Ala Tyr Gly 

5041 GAA GAG ACC AGC AAC TTC GGC ATG AGC GAC CTG ACC CTC GAC GGC AAC 
Glu Glu Thr Ser Asn Phe Gly Met Ser Asp Leu Thr Leu Asp Gly Asn 

5089 CGC GAC AAC ACC AGC GGC AAG GTC GAC GGC TGG TTC AAC GGC TAC ATC 
Arg Asp Asn Thr Ser Gly Lys Val Asp Gly Trp Phe Asn Gly Tyr He 

5137 CCC GGC CAG GAC GGC GCC GAC CGC AAC GTG ACC ATC GAG CGG GTG GAA 
Pro Gly Gin Asp Gly Ala Asp Arg Asn Val Thr He Glu Arg Val Glu 

5185 ATC CGC GAG ATG TCC GGC TAT GGC TTC GAT CCG CAC GAG CAG ACC ATC 
He Arg Glu Met Ser Gly Tyr Gly Phe Asp Pro His Glu Gin Thr He 

5233 AAC CTG ACG ATC CGC GAC AGC GTG GCC CAC GAC AAC GGC CTC GAC GGC 
Asn Leu Thr He Arg Asp Ser Val Ala His Asp Asn Gly Leu Asp Gly 

5281 TTC GTC GCC GAC TAC CTG GTC GAC AGC GTG TTC GAG AAC AAC GTC GCC 
Phe Val Ala Asp Tyr Leu Val Asp Ser Val Phe Glu Asn Asn Val Ala 

5329 TAC AAC AAC GAC CGC CAC GGC TTC AAC ATC GTC ACC AGC ACC TAC GAT 
Tyr Asn Asn Asp Arg His Gly Phe *sn He Val Thr Ser Thr Tyr Asp 

5377 TTC GTC ATG ACC AAC AAC GTC GCC TAC GGC AAC GGC GGC GCC GGC CTG 
Phe Val Met Thr Asn Asn Val Ala Tyr Gly Asn Gly Gly Ala Gly Leu 

5425 ACG ATC CAG CGG GGC TCG GAG GAC CTG GCC CAG CCG ACC GAT ATC CTC 
Thr He Gin Arg Gly Ser Glu Asp Leu Ala Gin Pro Thr Asp He Leu 

5473 ATC GAC GGC GGC GCC TAC TAC GAC AAC GCC CTG GAA GGC GTC CTC TTC 
He Asp Gly Gly Ala Tyr Tyr Asp Asn Ala Leu Glu Gly Val Leu Phe 

5521 AAG ATC ACC AAC AAC GTC ACC CTG CAG AAC GCC GAG ATC TAC GGC AAC 
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Lys Met Thr Asn Asn Val Thr Leu Gin Asn Ala Glu He Tyr Gly Asn 

5569 GGC TCC TCC GGC GTG CGC CTG TAC GGC ACG GAG GAC GTG CAG ATC CTC 
Gly Ser Ser Gly Val Arg Leu Tyr Giy Thr Glu Asp Val Gin He Leu 

5617 GAC AAC CAG ATC CAC GAC AAT TCG CAG AAC GGC ACC TAT CCG GAA GTC 
Asp Asn Gin He His Asp Asn Ser Gin Asn Gly Thr Tyr Pro Glu Val 

5665 CTG CTG CAG GCC TTC GAC GAC AGC CAG GTC ACC GGT GAG CTG TAC GAG 
Leu Leu Gin Ala Phe Asp Asp Ser Gin Val Thr Gly Glu Leu Tyr Glu 

5713 ACC CTG AAC ACC CGG ATC GAA GGC AAT CTC ATC GAC GCT TCG GAC AAC 
Thr Leu Asn Thr Arg He Glu Gly Asn Leu He Asp Ala Ser Asp Asn 

5761 GCC AAC TAT GCG GTG CGC GAG CGC GAC GAC GGC AGC GAC TAC ACC ACG 
Ala Asn Tyr Ala Val Arg Glu Arg Asp Asp Gly Ser Asp Tyr Thr Thr 

5809 CTC GTG GAC AAC GAC ATC AGC GGC GGC CAG GTC GCC TCG GTG CAG CTT 
Leu Val Asp Asn Asp He Ser Gly Gly Gin Val Ala Ser Val Gin Leu 

5857 TCC GGC GCC CAT TCG AGT CTT TCC GGC GGC ACC GTC GAA GTG CCG CAG 
Ser Gly Ala His Ser Ser Leu Ser Gly Gly Thr Val Glu Val Pro Gin 

5905 GGG ACC GAC GGC AAC GAC GTG CTG GTC GGC AGC GAT GCC AAC GAC CAG 
Gly Thr Asp Gly Asn Asp Val Leu Val Gly Ser Asp Ala Asn Asp Gin 

5953 CTC TAC GGC GGA GCC GGC GAC GAC CGC CTG GAC GGC GGC GCC GGT GAC 
Leu Tyr Gly Gly Ala Gly Asp Asp Arg Leu Asp Gly Gly Ala Gly Asp 

6001 GAC CTG CTC GAC GGC GGA GCG GGG CGC GAC GAC CTG ACC GGC GGC ACG 
Asp Leu Leu Asp Gly Gly Ala Gly Arg Asp Asp Leu Thr Gly Gly Thr 

6049 GGT GCC GAC ACC TTC GTG TTC GCC GCG CGT ACC GAT AGC TAC CGC ACC 
Gly Ala Asp Thr Phe Val Phe Ala Ala Arg Thr Asp Ser Tyr Arg Thr 

6097 GAC GCG GGG GTG TTC AAC GAC CTG ATC CTC GAC TTC GAC GCC AGC GAG 
Asp Ala Gly Val Phe Asn Asp Leu He Leu Asp Phe Asp Ala Ser Glu 

6145 GAC CGC ATC GAC CTG TCC GCC CTG GGT TTC AGC GGC TTC GGC GAC GGC 
Asp Arg He Asp Leu Ser Ala Leu Gly Phe Ser Gly Phe Gly Asp Gly 

6193 TAC AAC GGC ACC CTG CTG GTG CAG CTC AGC AGC GCC GGA ACC CGT ACC 
Tyr Asn Gly Thr Leu Leu Val Gin Leu Ser Ser Ala Gly Thr Arg Thr 

6241 TAC CTC AAG AGC TAC GAG GAG GAC CTC GAG GGC CGG CGC TTC GAG GTC 
Tyr Leu Lys Ser Tyr Glu Glu Asp Leu Glu Gly Arg Arg Phe Glu Val 

6289 GCC CTG GAC GGC GAC CAC ACG GGC GAT CTT TCC GCC GCC AAT GTG GTT 
Ala Leu Asp Gly Asp His Thr Gly Asp Leu Ser Ala Ala Asn Val Val 

6337 TTC GCC GAC GAC GGC TCG GCC GCC GTG GCG AGC AGC GAT CCC GCC GCC 
Phe Ala Asp Asp Gly Ser Ala Ala Val Ala Ser Ser Asp Pro Ala Ala 

6385 ACA CAG TTG GAG GTG GTC GGC AGC AGC GGC ACC CAG ACC GAT CAA CTC 
Thr Gin Leu Glu Val Val Gly Ser Ser Gly Thr Gin Thr Asp Gin Leu 

6433 GCC TGA . TCCGACCCCG CCCATACCCG CCCGGCCATT CCGGCCGGGC GAACCAATGG 
Ala 

6489 TCTTCAGHCC AGTCTCAGGC ACAGCAGCGC GCGAGCCGCT TCGCTTTGTC CGCCCCCCGC 
6549 TTTTCTCGCT GAACGCGACG ATCGCCGGGC GCCGGGGAAG GGTTCGCCGC ATGCCGAGCC 
6609 GGGGACGGGA AAAGCCTGTT CGACCAGTCG ACTCTTCCTC CCTTCACTTT CCAGGCAGCC 

6669 TGCGGGCTGC GCAGTAACGG AACAGGAAGC AGC ATG GAT TAC AAC GTC AAA 

Met Asp Tyr Asn Val Lys 

6720 GAT TTC GGG GCG CTG GGC GAT GGC GTC AGC GAC GAT ACG GCC GCC ATC 
Asp Phe Gly Ala Leu Gly Asp Gly Val Ser Asp Asp Thr Ala Ala He 
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6768 


CAG 
Gin 


GCG 
Ala 


GCG 
Ala 


ATC 
He 


GAT 
Asp 


GCC 
Ala 


6816 


CTG 
Leu 


CCG 
Pro 


GCC 
Ala 


GGC 
Gly 


GAA 
Glu 


TAC 
Tyr 


6864 


GGT 
Gly 


TGC 
Cys 


CTG 
Leu 


ACC 
Thr 


ATC 
He 


AAG 
Lys 


6912 


GGC 
Gly 


GAG 
Glu 


ACG 
Thr 


GTC 
Val 


ATC 
He 


AAG 
Lys 


6960 


GGC 
Gly 


ATC 
He 


GTC 
Val 


CGC 
Arg 


TCG 
Ser 


GCC 
Ala 


7008 


AGC 
Ser 


GAC 
Asp 


CTG 
Leu 


ACC 
Thr 


CTC 
Leu 


GAC 
Asp 


7056 


GAC 
Asp 


GGC 
Gly 


TGG 
Trp 


TTC 
Phe 


AAC 
Asn 


GGC 
Gly 


7104 


GAC 
Asp 


GTG 
Val 


ACC 
Thr 


CTG 
Leu 


GAG 
Glu 


CGG 
Arg 


7152 


TTC 
Phe 


GAT 
Asp 


CCG 
Pro 


CAC 
His 


GAG 
Glu 


CAG 
Gin 


7200 


GCC 
Ala 


CAC 
His 


GAC 
Asp 


AAC 
Asn 


GGC 
Gly 


CTC 
Leu 


7248 


GGG 
Gly 


GTG 
Val 


TTC 
Phe 


GAG 
Glu 


AAC 
Asn 


AAC 
Asn 


7296 


AAC 
Asn 


ATC 
He 


GTC 
Val 


ACC 
Thr 


AGC 
Ser 


ACC 
Thr 


7344 


TAC 
Tyr 


GGC 
Gly 


AAC 
Asn 


GGC 
Gly 


GGC 
Gly 


GCC 
Ala 


7392 


GTG 
Val 


GCG 
Ala 


CAC 
His 


CCC 
Pro 


TAC 
Tyr 


GAC 
Asp 


7440 


AAC 
Asn 


GGC 
Gly 


CTG 
Leu 


GAA 
Glu 


GGC 
Gly 


GTG 
Val 


7488 


CAG 
Gin 


AAC 
Asn 


GCC 
Ala 


GAG 
Glu 


ATC 
He 


TAC 
Tyr 


7536 


GGC 
Gly 


GCC 
Ala 


GAG 
Glu 


GAT 
Asp 


GTG 
Val 


CAG 
Gin 


7584 


CAG 
Gin 


AAC 
Asn 


GGT 
Gly 


TCC 
Ser 


TAC 
Tyr 


GCG 
Ala 


7632 


GCC 
Ala 


GGG 
Gly 


GTG 
Val 


TCC 
Ser 


GGC 
Gly 


AAT 
Asn 


/DOU 


ppp 
Gly 


A AP 

Asn 


app 
Thr 


He 


Val 


GGL. 

Gly 


7728 


CGC 
Arg 


GAC 
Asp 


GAC 
Asp 


GGC 
Gly 


ACC 
Thr 


GAC 
Asp 


7776 


AAT 
Asn 


GTG 
Val 


CAG 
Gin 


AAC 
Asn 


GGC 
Gly 


TCG 
Ser 


7824 


TCC 
Ser 


GAC 
Asp 


CTG 
Leu 


CCC 
Pro 


GGC 
Gly 


ACC 
Thr 
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40 



GCC 
Ala 


TAC 
Tyr 


GCG 
Ala 


GCC 
Ala 


GGC 
Gly 


GGC 
Gly 


GGC 
Gly 


ACC 
Thr 


GTC 
Val 


TAC 
Tyr 


CGG 
Arg 


GTC 
Val 


AGC 
Ser 


GGC 
Gly 


GGC 
Gly 


GAG 
Glu 


GAG 
Glu 


CCT 
Pro 


TCC 
Ser 


GAT 
Asp 


AGC 
Ser 


AAC 
Asn 


GTC 
Val 


CAT 
His 


ATC 
He 


GTC 
Val 


GGC 
Gly 


GCG 
Ala 


GGG 
Gly 


ATG 
Met 


CTG 
Leu 


GTC 
Val 


GAC 
Asp 


GGC 
Gly 


TGG 
Trp 


GAT 
Asp 


CAG 
Gin 


GAC 
Asp 


GTC 
Val 


ACC 
Thr 


TAC 
Tyr 


GGC 
Gly 


GAG 
Glu 


GAG 
Glu 


ACC 
Thr 


AGC 
Ser 


AAfC 
Asn 


TTC 
Phe 


GGC 
Gly 


ATG 
Met 


GGC 
Gly 


AAC 
Asn 


CGC 
Arg 


GAC 
Asp 


AAC 
Asn 


ACC 
Thr 


AGC 
Ser 


GGC 
Gly 


AAG 
Lys 


GTC 
Val 


TAC 
Tyr 


ATT 
He 


CCC 
Pro 


GGC 
Gly 


GAG 
Glu 


GAP 
vans* 

Asp 


GGC 
Gly 


GPP 

Ala 


GAP 

Asp 


ppp 
Arg 


GTG 
Val 


GAA 
Glu 


ATC 
He 


CGT 

Arg 


Glu 


ATP 

Met 


TPP 

Ser 


ppt 

Gly 


TAP 

Tyr 


GGT 
Gly 


Thr 


ATP 

He 


a ap 
Asn 


Leu 


Thr 


AxC 

He 


CGC 

Arg 


GAC 

Asp 


AGC 

Ser 


GTG 

Val 


GAC 
Asp 


GGC 
Gly 


TTC 
Phe 


GTC 
Val 


GPP 

Ala 


GAT 

Asp 


TTP 
X X V- 

Phe 


PAP 

Gin 


ATP 

He 


ppp 
Gly 


GTC 
Val 


TCG 
Ser 


TAC 
Tyr 


AAC 
Asn 


Asn 


GAP 

Asp 


PGP 

Arg 


PAP 

His 


ppp 
Gly 


TTP 
X 1 V- 

Phe 


AAC 
Asn 


GAC 
Asp 


TTP 

X X \~ 

Phe 


Val 


Leu 


APP 

Ser 


A Af 

Asn 


a Ar* 

AAL. 

Asn 


PTP 

Val 


GCC 

Ala 


GGC 
Gly 


CTG 
Leu 


GTG 
Val 


Val 


PAG 

Gin 


ppp 
Arg 


ppp 
Gly 


TPP 

Ser 


TPP 

Ser 


PAP 

Asp 


ATC 
He 


CTG 
Leu 


ATC 

He 


GAP 
Asp 


GGP 

Gly 


ppp 
Gly 


ppp 
Ala 


TAP 

Tyr 


TAP 

Tyr 


Asp 


CAG 
Gin 


ATC 
He 


AAG 
Lys 


ATG 
Met 


GCC 
Ala 


CAP 

His 


GAP 
Asp 


GTP 

OX 

Val 


APP 

Thr 


v- XV? 

Leu 


GGC 
Gly 


AAC 
Asn 


GGC 
Gly 


CTA 
Leu 


TAC 
Tyr 


GGG 
Gly 


GTG 
Val 


CGC 
Arg 


GTP 

Val 


TAP 

Tyr 


ATC 
He 


CTC 
Leu 


GAC 
Asp 


AAC 
Asn 


TAC 
Tyr 


ATC 
He 


CAC 
His 


GAP 
Asp 


AAT 
nn X 

Asn 


TPP 

Ser 


GAA 
Glu 


ATC 
He 


CTC 
Leu 


CTG 
Leu 


CAG 
Gin 


TCC 
Ser 


TAP 

X ttv. 

Tyr 


GAP 

Asp 


GAT 

Asp 


APP 

Thr 


TTC 
Phe 


TAC 
Tyr 


ACC 
Thr 


ACC 
Thr 


ACC 

Thr 


GGP 
Gly 


APP 

Thr 


TPP 

Trp 


ATP 
Alt 

He 


PA A 
bAA 

Glu 


TCG 
Ser 


GCC 
Ala 


AAC 
Asn 


TCC 
Ser 


ACC 
Thr 


TAT 
Tyr 


GGC 
Gly 


ATC 
He 


CAG 
Gin 


GAG 
Glu 


TAC 
Tyr 


AGC 
Ser 


AGC 
Ser 


CTC 
Leu 


TAC 
Tyr 


GCC 
Ala 


AAC 
Asn 


AGC 
Ser 


GTC 
Val 


AGC 
Ser 


GTG 
Val 


CGC 
Arg 


CTC 
Leu 


TAC 
Tyr 


GGC 
Gly 


GCC 
Ala 


AAC 
Asn 


TCC 
Ser 


GTC 
Val 


GTC 
Val 


GGC 
Gly 


CAG 
Gin 


CAG 
Gin 


GCG 
Ala 


ACC 
Thr 


CTC 
Leu 


GAA 
Glu 


GGC 
Gly 


ACG 
Thr 


GCC 
Ala 
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/ O i z 


GGC 

Gly 


AAC 

Asn 


GAC 

Asp 


ACG 
Thr 


CTT 
Leu 


GGC 
Gly 


Toon 


Leu 


UAL 

Asp 


GGC 

Gly 


7V IN/"* 

AAC 

Asn 


GAC 
Asp 


CGC 
Arg 


7968 


GAC 
Asp 


GGC 
Gly 


GGP 

Gly 


GPP 

Ala 


GGP 

Gly 


CGC 

Arg 


8016 


PTG 

•Leu 


Phe 


PGP 
L-L»L. 

Arg 


PTP 
GXC 

Val 


xCC 

Ser 


GCG 

Ala 


8064 


AGC 
Ser 


TTC 

Phe 


AAP 

Asn 


GAP 

Asp 


CTG 
Leu 


AxC 

lie 


8112 


GAC 
Asp 


CTG 
Leu 


TCP 

Ser 


GPG 

Ala 


Leu 


GGC 

Gly 


8160 


ACC 
Thr 


CTG 
Leu 


CTG 
Leu 


PTG 

Leu 


PAG 

Gin 


VjXC 

Val 




Ser 


C lu 

Leu 


GAG 

Glu 


GCG 

Ala 


GAT 

Asp 


GCC 
Ala 




ggp 
Gly 


Asn 


1 xC 

Phe 


GCC 
Ala 


GGC 

Gly 


CTG 
Leu 




ALL 

Thr 


GCC 

Ala 


Arc 
lie 


GAG 
Glu 


GGG 
Gly 


GAT 
Asp 




bLL 

Ala 


GCC 

Ala 


GAG 
Glu 


ACA 
Thr 


TTG 
Leu 


CTC 
Leu 


Oft V U 


Gly 


GCC 

Ala 


GGC 

Gly 


GAC 
Asp 


GAC 
Asp 


ATC 
He 


8448 


Thr 


Gly 


GGP 

Gly 


ppr* 

GCC 

Ala 


GGA 

Gly 


GCG 

Ala 


8496 


AGP 

Ser 


PAG 

Gin 


PGP 

Arg 


AAC 

Asn 


•pup 

1 AC 

Tyr 


GAC 

Asp 




gap 
Asp 


TTC 
Phe 


V»LAjf 

Ala 


GIG 

Val 


GGC 

Gly 


GAA 

Glu 




Thr 


Gly 


C XG 

Leu 


GGC 

Gly 


GAC 
Asp 


GGC 
Gly 


8£A0 


Ser 


GPP 
bLL 

Ala 


GPP 
uOL 

Gly 


GAC 

Asp 


CGC 

Arg 


ACC 

Thr 


oooo 


GGP 

Gly 


TUP 

Tyr 


PGf* 

Arg 


X X \- 

Phe 


GAG 

Glu 


TTT 
Phe 


o/Jb 


GGC 

Gly 


AAC 

Asn 


GAG 

Glu 


GAT 
Asp 


TTC 
Phe 


ATC 
He 


8784 


GAA 
Glu 


GGC 
Gly 


AGC 
Ser 


GCC 
Ala 


GGC 
Gly 


AAC 
Asn 


8832 


GTG 
Val 


ATC 
lie 


CAC 
His 


GGC 
Gly 


GGC 
Gly 


GGC 
Gly 


8880 


GCC 
Ala 


GAC 
Asp 


GTG 
Val 


TTC 
Phe 


CGC 
Arg 


TTT 
Phe 


8928 


GCC 
Ala 


AGC 
Ser 


TAT 
Tyr 


GCC 
Ala 


GAT 
Asp 


CTG 
Leu 
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GGC 
Gly 


AGC 
Ser 


GAC 
Asp 


GCC 
Ala 


CAC 
His 


GAG 
Glu 


ACG 
Thr 


CTG 
Leu 


CTC 
Leu 


GGG 
Gly 


CTG 
Leu 


AAC 
Asn 


GGC 
Gly 


GGC 
Gly 


GCC 
Ala 


GGC 
Gly 


AAC 
Asn 


GAC 
Asp 


ATC 
He 


CTC 
Leu 


GAC 
Asp 


AAC 
Asn 


CTG 
Leu 


ACC 
Thr 


GGC 
Gly 


GGC 
Gly 


GCG 
Ala 


GGC 
Gly 


GCC 
Ala 


GAC 
Asp 


CGC 
Arg 


ACC 
Thr 


GAC 
Asp 


AGC 
Ser 


TAC 
Tyr 


CGC 
Arg 


ACC 
Thr 


GAC 
Asp 


AGC 
Ser 


GCC 
Ala 


ACC 
Thr 


GAC 
Asp 


TTC 
Phe 


GAC 
Asp 


GCC 
Ala 


AGC 
Ser 


CAG 
Gin 


GAC 
Asp 


CGC 
Arg 


ATC 
He 


TTC 
Phe 


ACC 
Thr 


GGG 
Gly 


CTG 
Leu 


GGC 
Gly 


GAC 
Asp 


GGC 
Gly 


TAT 
Tyr 


AAC 
Asn 


GGC 
Gly 


AGC 
Ser 


GCC 
Ala 


GAC 
Asp 


GGC 
Gly 


AGC 
Ser 


CGC 
Arg 


ACC 
Thr 


TAT 
Tyr 


CTG 
Leu 


AAG 
Lys 


GAG 
Glu 


GGG 
Gly 


CGG 
Arg 


CGT 
Arg 


TTC 
Phe 


GAG 
Glu 


ATC 
He 


GCC 
Ala 


CTG 
Leu 


GAC 
Asp 


CTC 
Leu 


GGT 
Gly 


GCC 
Ala 


GGC 
Gly 


AAC 
Asn 


CTG 
Leu 


CTC 
Leu 


TTC 
Phe 


GAG 
Glu 


CGC 
Arg 


GCC 
Ala 


GGC 
Gly 


GAC 
Asp 


AAC 
Asn 


GCC 
Ala 


CTG 
Leu 


CTC 
Leu 


GGT 
Gly 


ACC 
Thr 


TCG 
Ser 


GGC 
Gly 


CAC 
His 


GCC 
Ala 


GGC 
Gly 


AAC 
Asn 


GAC 
Asp 


ACG 
Thr 


CTC 
Leu 


GAC 
Asp 


GGC 
Gly 


CTG 
Leu 


GTC 
Val 


GGC 
Gly 


GGC 
Gly 


GCC 
Ala 


GGG 
Gly 


CGC 
Arg 


GAC 
Asp 


AGC 
Ser 


CTC 
Leu 


GAC 
Asp 


GTG 
Val 


TTC 
Phe 


CGC 
Arg 


TTC 
Phe 


GAC 
Asp 


GCG 
Ala 


CTG 
Leu 


TCC 
Ser 


GAC 
Asp 


ATC 
He 


GGC 
Gly 


GAC 
Asp 


AAC 
Asn 


CAG 
Gin 


GGC 
Gly 


GAC 
Asp 


CGC 
Arg 


ATC 

He 


GCC 
Ala 


GAC 
Asp 


AAG 
Lys 


CTC 
Leu 


GAC 
Asp 


GTA 
Val 


TCG 
Ser 


GCG 
Ala 


CTG 
Leu 


GGC 
Gly 


TTC 
Phe 


TAC 
Tyr 


AAC 
Asn 


GGC 
Gly 


ACC 
Thr 


CTC 
Leu 


GCC 
Ala 


CTG 
Leu 


GTG 
Val 


CTC 
Leu 


AAC 
Asn 


TAC 
Tyr 


GTG 
Val 


AAA 

Lys 


AGC 
Ser 


TAC 
Tyr 


GAG 
Glu 


AAC 
Asn 


GGC 
Gly 


GCC 
Ala 


GAC 
Asp 


TCC 
Ser 


CTC 
Leu 


GAC 
Asp 


GGC 
Gly 


AAC 
Asn 


TAT 
Tyr 


CTG 
Leu 


GAG 
Glu 


CTA 
Leu 


CTC 
Leu 


TTC 
Phe 


GCC 
Ala 


ACG 
Thr 


CCC 
Pro 


AGC 
Ser 


GGC 
Gly 


CAG 
Gin 


CAA 
Gin 


CTC 
Leu 


CTC 
Leu 


GAC 
Asp 


AGC 
Ser 


CTG 
Leu 


CAG 
Gin 


GGC 
Gly 


ACG 
Thr 


GCC 
Ala 


GCC 
Ala 


GAC 
Asp 


GAG 
Glu 


GGG 
Gly 


CGC 
Arg 


GAC 
Asp 


ACG 
Thr 


CTG 
Leu 


GCC 
Ala 


GGA 
Gly 


GGG 
Gly 


GCC 
Ala 


GGG 
Gly 


AGC 
Ser 


GAA 
Glu 


CTG 
Leu 


ACC 
Thr 


GAC 
Asp 


AGC 
Ser 


TAC 
Tyr 


CGA 
Arg 


GAC 
Asp 


AGT 
Ser 


ATC 
He 


ACT 
Thr 


GAC 
Asp 


TTC 
Phe 


GAT 
Asp 


GCC 
Ala 


AGC 
Ser 


GAG 
Glu 


GAT 
Asp 


CGT 
Arg 
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8976 


ATC 
lie 


GAC 
Asp 


CTG 
Leu 


TCC 
Ser 


GGC 
Gly 


CTC 
Leu 


ririr' 
Gly 


*nY* incir* nri ,m > mY* 
Phe Ser Gly Leu 


GGC aal 
Gly Asn 


Gly 


1AL 

Tyr 


GGC 

Gly 


9024 


GGT 
Gly 


ACC 
Thr 


CTG 
Leu 


GCG 
Ala 


CTG 
Leu 


CAG 
Gin 


GTG 
Val 


AAC AGC GCC GGT 

Asn Ser Ala Gly 


ACC CGC 

Thr Arg 


ACC 
Thr 


TAC 

Tyr 


Leu 


9072 


AAG 
Lys 


AGC 
Ser 


TTC 
Phe 


GAG 
Glu 


ACC 
Thr 


AAC 
Asn 


GCC 
Ala 


GCC Gdc nhn r , n r p 
Ala Gly Glu Arg 


•pry* ri'h.n 

1 IV uAu 

Phe Glu 


A JA- 

He 


GCC 

Ala 


CiG 

Leu 


9120 


GAC 
Asp 


GGC 
Gly 


GAC 
Asp 


CTG 
Leu 


TCC 
Ser 


GCG 
Ala 


CTC 
Leu 


GGC f2fif2 nrr* sir 

^avanj uLL AAL 

Gly Gly Ala Asn 


C 1G A IV 

Leu He 


Leu 


GAL. 

Asp 


GCG 

Ala 


9168 


CGT 
Arg 


ACC 
Thr 


GTA 
Val 


CTG 
Leu 


GCG 
Ala 


GGC 
Gly 


GGC 
Gly 


V»/\V~ WV. A/vC Vj/iC 

Asp Gly Asn Asp 


ACG V X'X 

Thr Leu 


Ser 


GGG 
Gly 


AGC 

Ser 


9216 


AGC 
Ser 


GCG 
Ala 


GCC 
Ala 


GAG 
Glu 


GAA 
Glu 


CTG 
Leu 


CTC 
Leu 


GGC nfifi finv* nrr* 

vjVjO VslG GGC 

Gly Gly Val Gly 


AAC bAL 

Asn Asp 


»pp 

AGC 

Ser 


C 1G 

Leu 


GAC 
Asp 


9264 


GGC 
Gly 


GGC 
Gly 


GCC 
Ala 


GGC 
Gly 


AAC 
Asn 


Asp 


ATP 

lie 


P f TY > r*^r* /"W* /"W* 
V- XV OAC WjL GGG 

Leu Asp Gly Gly 


GCG GGG 

Ala Gly 


CGC 
Arg 


GAC 
Asp 


ACC 
Thr 


9312 


CTG 
Leu 


AGT 
Ser 


GGC 
Gly 


GGC 
Gly 


AGC 
Ser 


GGC 
Gly 


Ser 


PIP fcTY* a*TV* p/y 

WW- A IV 1 lu CGC 

Asp He Phe Arg 


TTC GGC 

Phe Gly 


GGC 
Gly 


GCG 
Ala 


CTC 
Leu 


9360 


GAC 
Asp 


AGC 
Ser 


TTC 
Phe 


Arg 


Asn 


Tin 

Tyr 


GGC 

Ala 


AGC GGG ACG AAC 
Ser Gly Thr Asn 


GGC ACC 
Gly Thr 


GAC 
Asp 


AGC 
Ser 


ATC 
He 


9408 


ACC 
Thr 


GAC 
Asp 


Phe 


Thr 


Pro 


Gly 


GAG 
Glu 


GAT CTG ATC GAC 
Asp Leu He Asp 


CTC TCC 
Leu Ser 


GTG 
Val 


CTC 
Leu 


GGC 
Gly 


V* JO 


Tyr 


Thr 


Gly 


GIG 

Leu 


GGC 

Gly 


GAC 
Asp 


GGC 
Gly 


TAC AAC GGT ACC 
Tyr Asn Gly Thr 


CTG GCG 
Leu Ala 


ATA 
He 


GTG 
Val 


CTG 
Leu 


9504 


AAC 
Asn 


GAC 
Asp 


GCC 
Ala 


GGC 
Gly 


ACC 
Thr 


AAG 
Lys 


ACC 
Thr 


TAC CTG AAA AAC 
Tyr Leu Lys Asn 


CGC GAG 
Arg Glu 


AGC 
Ser 


GAC 
Asp 


GCC 
Ala 


9552 


GAA 
Glu 


GGC 
Gly 


AAC 
Asn 


CAG 
Gin 


TTC 
Phe 


GAG 
Glu 


ATC 
lie 


GCC CTG GAG GGC 
Ala Leu Glu Gly 


AAC CAC 
Asn His 


GCC 
Ala 


GAC 
Asp 


CAG 
Gin 


9600 


CTC 
Leu 


GAT 
Asp 


GCG 
Ala 


AGC 
Ser 


GAC 
Asp 


TTC 
Phe 


ATC 
He 


TTC GCC ACG GCG 
Phe Ala Thr Ala 


GCC GCG 
Ala Ala 


ACC 
Thr 


ACC 
Thr 


GGA 
Gly 


9648 


ATC 
lie 


GAG 
Glu 


GTG 
Val 


GTC 
Val 


GGC 
Gly 


GGC 
Gly 


AGC 
Ser 


GGC ACC CAG ACC 
Gly Thr Gin Thr 


GAT CAG 
Asp Gin 


CTC 
Leu 


GCC 
Ala 


TGA 



9696 TCCGACCCCG CCCGCACCCG CCCGGCCATT CCGGCCGGGC GAACCAATGG CCTTTTGATC 

9756 AGTCTCAGGC ACAGCAACGT GTGCGCCGCT TCGCTTGTTC GCCCTCCCGG CCTTGTTTCT 

9816 CGCTGAAAGC GACGATCGCC GGGGGCGTGC CGGGCGCGAG AAAAGGTTCG CCGTGTGCAA 

9876 AGCCGGGGAC GGGAAAAGCC TGTTCAAGTA GTCGACTCTT CCTTCTCCTT TTTCCTAGAC 



9936 


GGCCTCT 


TGG 


CTG 


AGC 


ATT 


AAC 


GGA 


ACA 


GGA 


AGC 


AGC 


ATG 
Met 


GAC 
Asp 


TTC 
Phe 


AAC 
Asn 


9985 


GTC 
Val 


AAA 
Lys 


GAT 
Asp 


TTC 
Phe 


GGG 
Gly 


GCA 
Ala 


CTG 
Lieu 


GGC 
Gly 


GAT 
Asp 


GGC 
Gly 


GCC 
Ala 


AGC 
Ser 


GAC 
Asp 


GAC 
Asp 


ACG 
Thr 


GCG 
Ala 


10033 


GCC 
Ala 


ATC 
He 


CAG 
Gin 


GCG 
Ala 


GCG 
Ala 


ATC 
He 


GAT 
Asp 


GCC 
Ala 


GCC 
Ala 


CAC 
His 


GCG 
Ala 


rcG 

i.la 


GGC 
Gly 


GGC 
Gly 


GGC 
Gly 


ACC 
Thr 


10081 


GTC 
Val 


TAC 
Tyr 


CTG 
Leu 


CCG 
Pro 


GCT 
Ala 


GGC 
Gly 


GAG 
Glu 


TAT 
Tyr 


CGG 
Arg 


GTC 
Val 


AGC 
Ser 


( GC 
Gly 


GGC 
Gly 


GAG 
Glu 


GAG 
Glu 


CCT 
Pro 


10129 


TCC 
Ser 


GAC 
Asp 


GGC 
Gly 


GCG 
Ala 


CTG 
Leu 


ACC 
Thr 


ATC 
He 


AAG 
Lys 


AGC 
Ser 


AAC 
Asn 


GTC 
Val 


TAT 
Tyr 


ATC 
He 


GTC 
Val 


GGC 
Gly 


GCC 
Ala 


10177 


GGG 
Gly 


ATG 
Met 


GGC 
Gly 


GAG 
Glu 


ACG 
Thr 


GTG 
Val 


ATC 
He 


AAG 
Lys 


ATG 
Met 


GTC 
Val 


GAC 
Asp 


GGC 
Gly 


TGG 
Trp 


ACG 
Thr 


CAG 
Gin 


AAC 
Asn 


10225 


GTC 


ACC 


GGC 


ATG 


GTG 


CGC 


TCG 


GCC 


TAT 


GGC 


GAG 


GAG 


ACC 


AGC 


AAC 


TTC 
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